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Abstract 

We introduce a uniform representation of general objects that captures the regu- 
larities with respect to their structure. It allows a representation of a general class of 
objects including geometric patterns and images in a sparse, modular, hierarchical, 
and recursive manner. The representation can exploit any computable regularity in 
objects to compactly describe them, while also being capable of representing random 
objects as raw data. A set of rules uniformly dictates the interpretation of the rep- 
resentation into raw signal, which makes it possible to ask what pattern a given raw 
signal contains. Also, it allows simple separation of the information that we wish to 
ignore from that which we measure, by using a set of maps to delineate the a priori 
parts of the objects, leaving only the information in the structure. 

Using the representation, we introduce a measure of information in general objects 
relative to structures defined by the set of maps. We point out that the common pre- 
scription of encoding objects by strings to use Kolmogorov complexity is meaningless 
when, as often is the case, the encoding is not specified in any way other than that it 
exists. Noting this, we define the measure directly in terms of the structures of the 
spaces in which the objects reside. As a result, the measure is defined relative to a set 
of maps that characterize the structures. Though it does not depend on Kolmogorov 
complexity, it raises a question of their relationship, as the class of applicable objects 
includes strings. It turns out that the measure is equivalent to Kolmogorov complexity 
when it is defined relative to the maps characterizing the structure of natural num- 
bers. Thus, the formulation gives the larger class of objects a meaningful measure of 
information that generalizes Kolmogorov complexity. 

1 Introduction 

What is a pattern? There does not seem to be a generally accepted mathematical definition. 
Intuitively, a pattern is something simpler than it is apparent. For instance, a repetition of 
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a short substring in a longer string is a pattern: the longer string is simpler, or contains 
less information, than most other strings of the same length. Here, we see a comparison 
between the apparent size (length in the literal representation) and the "real" amount of 
information. Formally, this can be stated in terms of Kolmogorov complexity[l, 2, 9, 13, 
14] of the string, which is roughly defined as the length of the shortest input to a universal 
Turing machine that produces the string. A string can be said to have a pattern if its 
Kolmogorov complexity is much smaller than its length: strings that can be effectively 
described in a significantly shorter description than their length have patterns. Our goal in 
this paper is to formalize this notion in the domain of more general objects than strings. 

For instance, consider bitmap images. Ordinary images are much orderly than what is 
allowed by their representation as an array of colors; if we take a random bitmap out of 
all that can be represented as a bitmap, it is almost always a white noise, rather than what 
we would consider an ordinary image. This is similar to the string case where most strings 
of a given length are random ones that do not have a shorter description than the literal 
one. What is the corresponding "effective description" of images? Intuitively, it should be 
a way to describe the image in which ordinary images can be represented more concisely 
than noise images. 

A Turing machine that produces a bitmap does not suffice because, unlike the case of 
strings, where all strings can be represented precisely as they are, the bitmaps are only 
approximations of what we consider to be real images: Pixels are artifacts of arbitrary 
approximation; and we naturally consider bitmaps of various resolutions as the same, if 
they show the same scene. There would be no problem if it were the case that all important 
features of an image are independent of the choice of pixelation. However, this is clearly 
not so: even a notion as simple as that of a line is not so simple to define on bitmaps, 
especially in such a way a line in one resolution can be converted into a line in another 
resolution. 

Infinite resolution bitmaps, or functions on an image domain that takes values in the 
color space, seem to be good enough for the literal representation. But then, the objects 
appearing there are continuous, infinite entities and thus cannot easily be described effec- 
tively as, for instance, an output of a Turing machine. Yet intuition tells us that some of 
these infinite entities contain only finite information, as the extreme cases of "geometric" 
visual patterns shown in Figure 1 . 

1.1 Kolmogorov Complexity Covers All? 

But surely, one might say, Kolmogorov complexity already covers any domain, since Com- 
puter Science teaches us that information can be encoded by strings. That is, we can first 
fix some standard enumeration of the objects, establishing a one-to-one correspondence 
between the objects and strings; then we can define the complexity of an object to be the 
Kolmogorov complexity of the corresponding string. That seems to be where such an in- 
quiry usually stops, content with the notion that essentially we only need to investigate 
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Figure 1: Visual patterns. 



strings. 

However, we immediately encounter a few problems. 

First, for a class of objects (such as subsets of a Euclidean space) that has a larger 
cardinality than the set of all strings, we cannot encode all objects by strings; thus we 
need to give up the one-to-one correspondence. We must either encode only some of the 
objects, encode (perhaps an infinite number of) multiple objects by each string, or employ 
some combination of the two approaches. The choice amounts to knowing what to ignore, 
whether it is some (even most) of the objects that are not encoded, or the difference between 
the objects that are encoded into the same string. How should we make this choice? 

More fundamentally, the resulting measure has little meaning without actually specify- 
ing the encoding. Let O and S be the sets of objects and strings, respectively, and K(s) the 
Kolmogorov complexity of a string s. With an encoding / : O — > S , we might call K(f(x)) 
the complexity of object x. However, if we do not have some good reason to take a partic- 
ular /, we can equally use the encoding p o / with an arbitrary permutation p : S — * S . 
This observation renders the definition meaningless without / explicitly specified. So the 
question is: what is the encoding that gives the complexity some meaning? How can we 
avoid falling into this trap of arbitrariness? It is certainly not enough just to say that it can 
be encoded. 

With strings, we can choose the identity map as /, which gives K(f(x)) as much mean- 
ing as K(s). In other cases, however, we need to specify /, with at least some justification. 
If we insist encoding objects into strings, we need to define a concrete encoding for each 
class of objects. 

Another problem of measuring the information solely through Kolmogorov complexity 
is that we cannot easily ignore the part of information we do not care. For instance, we 
may try to represent a point in the Euclidean plane by identifying the space with K 2 , i.e., 
by a pair of real coordinates, and then encoding them by strings. However, a single real 
number can contain an arbitrarily large amount of information. Thus, in this representation, 
a single point can have an arbitrarily large information when encoded by a string. That is 
certainly not what we want here. Thus, an important part of the encoding is specifying the 
part of the information we wish to ignore. But we cannot simply delete such information 



3 



in the encoding process, since it may be needed to identify and measure the regularities in 
the structure later. If we insist that the computation be carried out strictly in the domain of 
strings, /'s output must contain all the information in the points. But after the information 
has been converted into strings, how do we specify which part of the information should 
be ignored? 

It would be much better if we can define the notion of computation, such as compres- 
sion and pattern finding, directly in terms of the objects we deal with. What we offer in this 
paper is a meta definition of / for multiple classes of objects by specifying how to embed 
computations in larger spaces. Central to the formalism is the representation of objects that 
offers the means to specify the information in individual elements that should be ignored, 
while using that very information to find the structures, in which we wish to measure the 
amount of information. 

Thus, the central problem is that of encoding, or representation, of objects. The paradigm 
to measure information through computation is the same; the difference is where the com- 
putation takes place. This question of encoding seems to have suffered a neglect which, in 
our belief, has prevented us from formulating a notion of information in objects that have 
not already been encoded in a convenient way. We discuss this further in Section 7. 

1.2 Motivation 

Our motivation for asking this question stems from the desire to model perception. For 
perception, there needs to be a large amount of prior knowledge stored in the perceiver, 
because perception is an inherently ill-posed problem. Perception is a process in which 
the configuration of the signal source is recovered from a signal, as in recovering a three- 
dimensional scene from an image. 

The problem is that, given the signal, there are usually infinitely many possible source 
configurations. Without a preference of possible source configuration on the side of the 
perceiver, there is no reason to choose one possibility over another. For instance, our 
visual system has a great capability to organize the visual signal into interpretable shapes, 
like making sense of the famous Dalmatian photo by R. C. James in [5]. To model such a 
system, it is not enough to know what the possible configurations of the signal source are; 
we need to know in advance how likely we are going to encounter each of them. 

However, even putting aside the problem of estimating the probabilities, just storing and 
retrieving the data is impossible unless we have a very good way to compress the data; for 
instance, if we store the possible shape of surfaces as an array of 10 possible heights at each 
of 10 x 10 positions, the number of possible surfaces would be 10 100 . The way this problem 
has been dealt with is by estimating the probabilities by looking at specific characteristics 
of the possible surface. For instance, the surface smoothness can be computed from a given 
description of the surface; we can then decide, for instance, that the smoother the surface 
is, the higher the probability. Indeed, the area of computer vision and pattern recognition 
is full of such heuristics. Even when machine learning techniques are used, the variables to 
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be learned must be carefully chosen because we cannot simply learn all possible surfaces. 

Our desire to have a measure of information originates from the wish to have a principle 
for automatically deciding which quantity to look at and which combinations of variables 
to learn, because we consider it reasonable for a perceiving entity to look first for simpler 
patterns in the signal, as well as because of the demand of storage efficiency. That is, if we 
have a measure of simplicity of general visual objects, we can say, for instance, that the 
probability is proportional to the simplicity, or use machine learning techniques to learn 
the probabilities that such simpler patterns appear. 

In more general terms, this is a problem of inductive inference and modeling: we in- 
ductively seek a model of the world that best explains the data. There are theories that treat 
such a problem, and among them are ones with the spirit we describe above. For instance, 
the Minimum Description Length (MDL) principled 1] advocates Occam's Razor. Among 
models that equally fit the data, it chooses the one that is the "simplest" in the sense that 
it allows for a shorter description of the data. However, crucially missing from this the- 
ory is the problem of representation. The MDL theory only deals with strings as the data 
and does not say how the objects should be described by strings. It is a good principle 
for people trying to deal with individual problems they understand; but when it comes to 
dealing with general objects, it lacks the mathematical concreteness needed to program the 
principle itself into machines. 

Also, as perceiving entities, we seem to have more interest in the finite part of the data. 
One may even say that we can only perceive the finite information out of any infinitely rich 
source of information, on the basis that our capacity of representation is presumably finite. 
For instance, if we see a white noise image, we do not perceive the amount of information 
that can be encoded in such an image. Instead, we glean the information that we can; we 
might just note that it is a white noise, or if it is a video we would recognize that the noise 
is constantly changing, and so on. If we see the three images in Figure 2, which are the 
same pattern with different noise added, we do not discriminate among them. Even though 
as raw bitmaps they are quite different, we perceive almost nothing about the noise except 
for its presence; we just recognize the pattern of the lines as the same and notice that there 
are some noise. Thus, to model the perception, we need a way to recognize the part of an 
infinite signal that represents finite but useful information. This is why we are especially 
interested in inherently finite structures whose literal manifestations are infinite. 

The human visual system seems to have "the ability to impose organization on sen- 
sory data — to discover regularity, coherence, continuity, etc., on many levels," which is 
"apart from both the perception of tri-dimensionality and from the recognition of famil- 
iar objects [15]." We agree that such structure and organization that appears at every level 
is the key to modeling vision and perception in general. One purpose of this work is to 
provide a language to express the perceptual organization that enables us to implement the 
ability to impose it on the data. 
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Figure 2: Visual patterns with noise. 



1.3 Desiderata 

What we seek is a description, or representation, of general objects with the following 
properties: 

I. General: It can represent a general class of objects and all objects in the class, includ- 
ing a part of the descriptions themselves, allowing hierarchical description. 

II. Uniform: It represents the objects in a uniform way by simple rules. 

HI. Reflect complexity: The intuitive complexity, or the amount of information in the 
object, corresponds to the complexity of the description. In particular, intuitively 
finite object has a finite description. 

IV. Grounded: There is a set of rules that applies to the whole class of objects, not de- 
pending on the instance of the object, dictating how the described objects are related 
to the raw signal. 

In the case of strings, the literal representation satisfies I and II: representing a string 
as a string is obviously general enough to represent any string, and the representation is 
uniform for any string. Fixing a universal Turing machine U, one can consider a program 
p for U as a description of a string x if p causes the machine to halt after writing out 
the string x on the tape. Intuitively simple string would have a shorter program. Also, 
describing strings by other strings automatically satisfy the describability of descriptions 
and the groundedness. Thus this description would satisfy all of the desiderata. 

In the case of images, we can think of a function on a rectangle in R 2 as the literal 
representation, satisfying the desiderata I and II. But we do not know of a representation 
that satisfies all of the desiderata. Perhaps the closest is the page description languages like 
PostScript, possibly modified to allow infinite precision. However, it has too many primi- 
tives to be convenient for mathematical treatment. Also, the uniformity and simplicity of 
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the rules of description is important not only for the sake of mathematical convenience, 
but also because we aim eventually to develop a way of automatically extract such descrip- 
tion from the literal description, or the signal. More crucially, PostScript is not general 
enough: its class of objects is limited to two dimensional pages. The applicability to more 
general objects than just images is important because we would like to find a description 
that reflects the structure within more abstract data than a two-dimensional page, especially 
the description itself. For instance, if we have a way to describe circles by center points 
and radii, we would have a three-dimensional space of circles. We would like to use the 
same uniform description to describe a group of the 3D points corresponding to the circles. 
Thus, allowing "describing the description" is crucial in order to allow efficient description 
of repetitive, hierarchical, and more general structures. 

The groundedness requirement is needed to treat general structures in a uniform way. 
When we say that some data represents some object, we implicitly assume a set of rules 
for data interpretation and manipulation. It is this set of rules that gives the structure 
to the object. It is like a machine with knobs and buttons to control it: knowing their 
settings may be enough to determine the state of the machine; but to describe the effect and 
interaction of the machine with the environment and other machines, we need more than 
the internal parameters. If the rules are ad hoc, varying from one instance of representation 
to another, it would be impossible to formulate the notion of general structures and describe 
the manipulation of and interaction between such structures. 

All the desiderata are related to each other. In particular, we emphasize the following: 
it is not enough that the simple objects correspond to less data (III); the correspondence 
must be obtainable from the representation (IV) in a uniform (II) way that applies to the 
whole general class (I) of objects. For instance, we can say a pair of a point and a real 
number represents a circle by regarding them as the center and the radius. Or we can 
say that the pair represents a line by regarding it as a point the line goes through and its 
angular direction. But for the representation to cover both cases, one must also include in 
the representation some data specifying which case it is for each object. Such data quickly 
adds up when one wants to represent various shapes; so when we say that general shapes 
are described in a representation, it has not only to cover all the shapes but also include 
the necessary data in a way any shape represented can be converted into a common, literal 
representation. 

1.4 Related Work 

The General Pattern Theory [6, 7, 8] is an effort to provide an algebraic framework for 
describing patterns as structures. It defines a vocabulary which is manipulated to cast the 
concept of pattern in a precise algebraic language. While it has detailed algebraic and 
statistical theories with many examples, we only discuss here the part that deals with the 
representation of patterns. The representation is based on graphs. A graph is fixed; each of 
its node can be assigned one of generators, whose set is predetermined; a restriction as to 



7 



which combination of generators can be assigned to the nodes is defined as a set of pairwise 
restrictions corresponding to the edges of the graph. There are numerous examples in the 
literature showing that this representation can be used to represent many classes of objects. 

The Syntactic Pattern Recognition[4] also represents patterns in a way that explicitly 
handles the interrelationships between the parts that make up the whole of the object, and 
use the explicit structure in patterns to recognize them. The representation is either by a 
formal language or a graph. 

Neither of the representations used in the two formalisms is satisfactory for us. The 
crucial problem is that they are not grounded in the sense above. They are not uniform 
from a class of objects to another; thus, although we can talk about the information in 
objects in each class, there is no way to compare them across the classes. They are general 
in the sense one can adopt them to many different classes of objects, but they are not 
general enough to represent all of the classes in the same uniform way. They cannot be 
used, for instance, to define what patterns are, because there is no prescribed way for the 
representation to be connected to general enough class of objects. There is no formal way 
to give a raw data and ask what pattern it might form. 

Besides the Kolmogorov complexity we already mentioned, there are many notions of 
the complexity of objects. Most of them are concerned about the complexity of objects that 
do not include what we deal with in this paper. Also, note that what we define in this paper 
is a measure of information like Kolmogorov complexity and Shannon information [12], 
rather than a measure of complexity such as the computational complexity. We refer the 
reader to the appendix of [3] for an overview of formulations of complexity with an exten- 
sive bibliography. 

1.5 Overview of the Representation 

In this paper, we introduce a representation that fulfills all the desiderata above. Here, we 
give an overview of its definition and some of its properties. 

We assume that the objects are given a priori as subsets of some sets. Those that can 
be thought of in this way forms a very general class that seems to include most, if not all, 
objects we might deal with. For example, we can think of a binary string s = s S\ ■ ■ ■ s n as 
a subset {(/, si) \ i = 0, • • • , n} of N x {0, 1 }; an image can be thought of as a subset of the 
product of the image plane and the color space, i.e., the graph of the image function on the 
image plane. A physical object like a bicycle or an automobile, at one level of abstraction, 
can be thought of as a subset of E 3 x M, where E 3 is the 3D Euclidean space and M the 
set of materials, e.g., glass, iron, rubber, etc.; the subset consists of (x, m(x)), with m(x) the 
material that occupies the point x in E 3 . We call this representation of objects as subsets 
the ground representation; it serves as a signal-level, literal representation. It is simple to 
represent something in this way; we can then ask the amount of information therein. 

The ground representation is an abstraction of the kind of data representation that we 
call the dense representation, which includes strings, bitmaps, and other raw data. It cor- 
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responds to representing a string as itself. One property of dense representation is that the 
presence of regularities does not affect it. For instance, any image can be represented as a 
bitmap in exactly the same manner, whether it is a regular image or a white noise. Another 
type of representation, which we call the sparse representation, utilizes regularities in the 
object to describe it. In the image example, if it is an image of geometric objects, it should 
take very little data to describe it, at least in principle; a circle, for instance, can be repre- 
sented just by specifying its center point and radius. The same kind of description cannot 
be used to describe a white noise. 

An important feature of the representation proposed in this paper is that it can interpo- 
late between the dense and sparse representation, so that it can take advantage of regularity 
in the data while also being capable of representing any, even random, data. This is similar 
to using an input to a Turing machine to represent strings. 

As the vocabulary to describe the regularities in such objects, we use the maps that 
characterize the space in which the objects are included as subsets. Maps characterize the 
structure of spaces in the following sense. Any two sets with the same cardinality are the 
same sets in absence of other characteristics. For instance, R is the same set in this sense 
as K 2 , i.e., there exists a one-to-one map between them, if we disregard the structures such 
as the topology, the vector space structure, the metric structure, the order, and the algebraic 
structure. These structures can be characterized by maps. For instance, the metric structure 
is defined by the distance map that gives the distance between two elements of the set; the 
order is given by a predicate on a pair of elements that returns true if the first element is 
less than the second. The two sets are different when we consider the structures because 
the one-to-one map does not commute with the maps that define the structures. 

Using such structure maps to describe regularity, objects with regularities can be repre- 
sented through sparse parameters in the proposed representation. For a given object, there 
can be many different ways of representing it, just as there can be any number of inputs 
to a Turing machine producing the same string. Importantly, there is a prescribed way to 
connect the description to the ground representation; thus, the representation is grounded. 
While taking structures into account so that regular objects can be represented as such, it 
automatically provides an interpretation of each represented object into the signal level. In 
other words, the relationship between the parameters and the data is part of the represen- 
tation. Thus, we can give our data in the ground representation and then ask what sparser, 
more structured representation is possible. 

Let us be slightly more concrete. In the proposed representation, we take a number 
of sets and maps between them (that are to be composed by the structure maps), which 
we call a diagram. Then we call an assignment of a subset to each set in the diagram its 
cross section. A cross section must satisfy a certain constraints, because of which we can 
uniquely determine all the subsets by specifying only a partial cross section, which assigns 
subsets to only some of the sets in the diagram. If one of the subsets coincides with the 
ground representation of the object in question, we say that it is represented by the diagram 
and the partial cross section. 
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The maps define the structures we take into consideration, which determine the reg- 
ularities, which in turn allow more concise description of the object than the literal one. 
Since the representation by diagrams and cross sections is explicitly in terms of the maps 
in the diagram, it is apparent from the diagram exactly what structure is taken into account. 

Some of the properties of the representation are as follows. It is: 

i) Sparse: Unlike dense representations such as bitmap, it is capable of representing 
objects by a combination of their essential structure and instance-specific parameters. 
The diagram expresses the essential structure while the partial cross section represents 
the parameters. Implementation-dependent approximation of representation only af- 
fects the parameters and thus can be separated from the structure. The sparseness 
also makes it flexible and easy to manipulate. By modifying the parameters, different 
instances of the same structure can be easily represented. Also, comparison of two 
patterns having the same structure is naturally defined. 

ii) Modular: Parts of the representation can be understood as the modules to construct 
larger and more complex ones. Complex combinations can be obtained hierarchically 
and recursively as well as by simple union and intersection. 

iii) Hierarchical: Because it can be applied to any data, it can also be applied to the param- 
eter space parametrizing some other structure, leading to a hierarchical representation. 

iv) Recursive: It can represent a recursively defined structure, making it particularly pow- 
erful in, for instance, representing repeated patterns. The "repeat" can be in various 
spaces that can manifest in the final pattern in non-obvious ways. 

Finally, diagrams and their cross sections can represent maps between powersets. In 
fact, the representation of subsets can be considered a special case where the map sends a 
trivial set to the subset. Any computation, in particular, can also be represented. 

1.6 Measure of Structural Information 

Using the representation, we introduce a measure of information. Roughly speaking, it is 
defined as the size of the smallest diagram representing the object, where the diagram can 
contain only those maps that can be composed by a set of given structure maps, including 
constant maps. 

Thus, the measure is relative to the structure and constants expressed explicitly in the 
form of maps. The explicit incorporation of the structure of the object space is the key to 
avoiding the trap of arbitrariness. The patterns such as shown in Figure 1 all have finite 
information according to the measure. The measure is relative to the constants because of 
the aforementioned need to separate the information in the structure from that in infinite 
objects such as real numbers. 

Because of the reasons laid out in 1.1, we do not follow the recipe of interfacing Kol- 
mogorov complexity by encoding objects by strings; strings are given no special status in 
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this theory. Instead, we define it directly in terms of the structure of the spaces in which 
the objects reside. As such, the new measure does not depend on Kolmogorov complexity. 
Since the class of applicable objects includes that of strings, however, it raises a question of 
their relationship. It turns out that the new measure is equivalent to Kolmogorov complex- 
ity in the case where strings are characterized by the structure of natural numbers given by 
the constant and the successor function. Note that this is not obvious a priori: the defi- 
nition of the representation and the measure does not even mention strings. Also, the new 
measure is defined relative to structure maps. It is equivalent to Kolmogorov complexity 
when it is defined relative to this particular set of structure maps; relative to other sets, it 
may not be. If the set includes the constant maps of all strings, for example, any string's 
information would be 1 . 

Thus, the new measure gives the larger class of objects a meaningful measure of infor- 
mation that generalizes Kolmogorov complexity. 

The rest of the paper is organized as follows. In the next section, we define the notion of 
diagrams and their cross sections precisely, as well as what is meant by representing with 
them. We also list the notations used throughout this paper. In section 3, we illustrate some 
properties of the representation by geometric examples. In section 4, we give more exam- 
ples, this time those representing computations. In section 5, we define the information 
measure of structure of general objects. In section 6, we prove that the measure general- 
izes Kolmogorov complexity. In section 7, we further discuss the difference between our 
approach and the string-centered one, before concluding. 

2 Representation by Diagrams and Cross Sections 

2.1 Definitions 

We fix the notation for standard finite sets as 1 = {0}, 2 = {0, 1}, • • • , n = {0, 1, • • • ,n— 1}, 
etc. The set 2 is also used as the set of Boolean values, meaning false and 1 true. We 
mean by / : X — > Y that / is a map from X to Y. We denote the set of all subsets of X (the 
power set of X) by 2 X . The map from 1 to X that maps e 1 to i e X is denoted by the 
same letter x : 1 — * X. We call it a constant map. 

Definition 1. Let = (5,), eJ r be a family of sets indexed by a set J* . A cross section s of 
5? is an assignment to each set 5, -in 5? of its subset s\ c S,-. 

In other words, a cross section of y is another family (s,),- e ^ of sets indexed by y such 
that Si c Si for all i e J* . We used the index set to make clear that there can be multiple 
members of the family that are identical as sets; however, we avoid the use of indices 
almost entirely in this paper. We use the set-theoretic notation with such as S e . 
The equality of two members of 5? means that their indices are the same; if the indices are 
different, we treat them as different, even if they are identical as sets. When we discuss a 
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set S = S , in y and a cross section s of y, s(S) denotes the subset S; assigned to 5, by s. 
Thus, s assigns each S e y its subset s(S) c 5 . 

We denote the set of cross sections of y by r(,y). Let ^ be a subfamily of y. A 
cross section of 2? is called a partial cross section of y . For a cross section 5 of y , the 
cross section of 2? that assigns 5 , (5) to S in ^ is called the restriction of s to 3*, denoted 
by s\j. For a cross section f of 3*, we denote the set {s e r(y) \ s\ : y = t} of cross sections 
of y that restrict to t by T(y \ t). 

Definition 2. A diagram is a triple (y,y',J%) of a family ^ = (5 ftu-J of sets, its 
subfamily y , and a family ^# = {ipj)j e j of maps of the form (pj : 2 5 — > 2 r , with 
S,T € y, where ^ are index sets. 

A diagram (,y, y, such that both J? 7 and jft are finite is called a finite diagram. 
Let (y, y, be a diagram. There are maps dm : jft — > J? 7 and cdm : ^# — > J? 7 such 
that 95 : 2 dra(v) -> 2 cdm(v) for <p e Jt . Also, define the maps out = dm' 1 : y -> 1 M 
and in = cdm -1 : y -> 2 M so that out(S) = {cp e ^\dm(<p) = S} and in(5) = {(p e 
Jt | cdm(^) = S } for S e y. 

Definition 3. The cross section of a diagram (y, y, Jt) is a cross section s of y such 
that, for any S e y with in(S') + 0, the following holds: 

s(S) = P| <p(s(dm(<p))) if S e,y\y\ (1) 

V ein(S) 

s(S)= [J <p(s(dm((p))) if S (= y . (2) 

vein(S') 

In diagram (y,y,^f), the subfamily y of J? 7 specifies the sets for which a cross 
section should satisfy (2) instead of (1); this means that the cross section on that set 
should be the union, rather than the intersection, of the images by the incoming maps. 
We denote the set of cross sections of diagram (y, y, ^) by r(y, y, ^#). We also 
define r(y,y,JZ\t) = r(y, y, JT) n T{y\t) for & c y and t e r(3T); i.e., 
r(y, y, ^# 1 t) is the set of cross sections of diagram {y,y,yZ) that restrict to the 
cross section t of subfamily 3* of y. 

To illustrate the definitions by example, suppose y = {1,X, Y,Z, W}, y = {W}, w e 
W, and M = [w, cp, if/, rj, 6, k} with 

w : 2 1 -» 2 W , ^ : 2 y -» 2* </r : 2 Z -» 2 r , 
77 : 2 W -> 2 y , 5 : 2 W -> 2* k : 2* -» 2 W , 

where w is an element of W; the same letter denotes a constant map. We denote the diagram 
(y,y,^£) as follows: 

x m " Y (2) - " Z (3) 



(4) 
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For instance, ip maps each subset of Y to a subset of X. We omit the set 1 from the diagram: 
a constant map is shown as an incoming arrow, without the domain 1. Note also that 
the arrows have dotted shafts, which signifies that the map is between power sets. The 
parenthesized subscript numbers are for reference: as more than one sets in the family 
can be identical as sets, we use these to refer to them. We always use S, to mean the set 
with the subscript (i) in the diagram under discussion. Thus, if we are discussing the one 
in equation (3), Si means the set X and S 2 means Y, etc. Also, there are two kinds of 
arrowheads: the ordinary arrows and round ones. An arrow has the round arrowhead if and 
only if it is coming into a set in y, in this case S4 = W. For a cross section s of this 
diagram, we have, e.g., 

in(X) = {<p,5}, dm((p) = Y, s(X) = (p(s(Y)) n 6(s(W)), s(W) = k(s(X)) U {w}. 

A diagram and its partial cross section represents an object in the following sense. 

Definition 4. Let (y, y, jM*) be a diagram, X a set in y , 2? a subfamily of y , and t 
a cross section of . Suppose an object is represented in the ground representation as a 
subset A of X. The object is said to be represented by (y, y , 3 ', t, X) if s(X) = A for 
every cross section s in (y, y , <M \ t). 

The ground representation is a special case of this as a trivial representation; just take 
y = y = 2? = {X}, j$ = 0, and t(X) = A. Thus the representation is general enough to 
include all dense representation. The aim, however, is to enable more efficient representa- 
tion that captures the structure. 

Here, we also define the concepts of minimality and limit for later use. 

Definition 5. Let {y,,9",Ji) be a diagram, X e y, and G c r(,y,y',Ji). A cross 
section s e G such that no other t e G gives t(X) c s(X) is said to be minimal on X in G. 
We denote the set of such cross sections by min x G. 

Note that miny min^ G D mm x G n miny G for X, Y e y-. if s e mm x G n miny G, then 
s(Y) c t(Y) for any t e min x G since t e G and s e miny G; thus s e miny min x G. Since 
it is also the case that miny min x G c min x G, by symmetry it follows that miny min x G n 
minx miny G = min^ G n miny G. 

Definition 6. Let (y, y, J£) be a diagram, X e y, 5* c y, and t e r(T). Furthermore, 
let X\, • • • , X n be a finite number of sets in y. A subset A of X is said to be represented by 
the data (y, y, 3? ', t, X, (X\ , • • • , X n )) as a limit if s{X) = A for any cross section s in 
min Xn min Xn _, • • • min X| y , J{ \ t). 

2.2 Notations 

Here we list some more notations used in this paper. 
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i) For any set X, id : X — > X denotes the identity map on X and to : X — > 1 the unique 
map from X to 1 = {0}. The complement map cmpl : 2 X — > 2 X is defined for A c X 
by: 

cmpl(A) = C A =X\A. (4) 

ii) The product map /i X • • • X f n : X — > Yi x • • • X Y n of maps $ : X — » Y f (i = 1, • • • ,n) 
is defined by (fi X • • • X /„)(x) = (/i(x), • ■ • > fn(x))- Given a map / : X — » Y and a 
constant map z : 1 — > Z, one can construct a product map: 

/x(zo W ):I^FxZ 

of / and z°co : X — » Z. By abuse of notation, we denote this map by /xz : X — > 7 xZ. 
Similarly, we mix maps of the form X — > Y and 1 — > Z freely in making a product 
map. 

iii) For a Cartesian product X { x X 2 x ■ ■ ■ x X n , the map 

n t : X Y x X 2 x • ■ • x X n ^> Xj 

is the projection to the t'th component. We use a shorthand tt^ for the product map 

7T, x nj : Xi x • • • x X n — > X x X;, Tryjt for nj x x 7tk, and so on. 

iv) For a disjoint union Xi + X 2 + • • • + X n , the map 

ti : Xi Xj + X 2 + • • • + X„ 
is the injection from the f fh component. 

v) The map union f + g : X + Y — > Z of maps / : X — » Z and g : y — » Z is defined by 
(/ + g)W = fix) if x 6 X and (/ + g)(x) = g(x) ifx&Y. 

vi) For a map / : X — > y, we denote by the same letter the map / : 2 X — > 2 F between the 
power sets defined by /(A) = {/(x) | x e A} c y for A c X. 

vii) For a map / : X -> Y, the map : 2 y -» 2 X is defined by f~\A) = {x e X | /(x) e 
A} c X for A c y. By a slight abuse of notation, by f~ l (y) for y e Y we mean / _1 ({y}). 

viii) For a map / : X —> X, /° denotes id x . For a positive integer n, f" denotes the map 
/" : X — > X defined as applying / for n times as well as the map f n :2 X ^>2 X defined 
as in vi). When n is a negative integer, f" denotes the map f" : 2 X — > 2 X defined as 
applying f~ l : 2 X — > 2 X for -n times. 

3 Geometric Patterns 

Using diagrams and cross sections, we can represent geometric objects in a uniform and 
compact way. In this section, we introduce the representation and discuss its properties 
using examples. 
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3.1 Examples 

As the simplest example, we consider a circle in the Euclidean plane X. Let us denote the 
vector space of translations in X by V. Also, denote the map that sends (x, y) e X x X to 
x - y by sub : X x X — > V, and the map that gives the length of a vector by len : V — » R. 
Consider the following diagram: 



- V (2) 



(5) 



XtT.'i ■>■ A^ X X(A\ ^Xf' 



! sub" 1 

\ 

This denotes a diagram (^, with 

^ = (Si,--- ,S 5 ), ^' = 0, Si =R, S 2 = V, S 3 = S 5 =X, S 4 = XxX 

and 

^ = (len" 1 : 2 S> -» 2 S2 ,sub~ 1 : 2 Sl -» 2 s \n 2 ' 1 : 2 5a -> 2 s \;ri : 2* 4 -> 2 5s ). 

Note that, while the inverse maps are indicated by _1 , the power map in the forward 
direction is to be surmised from the convention that the map is between powersets. 
Suppose that & = {S i, S3} and that its cross section t is defined by 

f(Si) = {r}, t(S 3 ) = {p}, 

where r is a positive real number and p is a point in the Euclidean plane X. Let s be a cross 
section in r(y, y", J% \ t). Then, by (1), we have 

s(S 2 ) = len _1 (5(Si)) = len _1 (?(Si)) = len _1 ({r}) = {veV\ len(v) = #■}, 
s(S 4 ) = sub" 1 ( S (S 2 ))nn 2 - l ( S (S 3 )) 

= {(x,y) g X x X I x - y e s(5 2 ),y e s(S 3 )}, 
S (S 5 ) = {7ri((x,y))eX|(x,y)e j(S 4 )} 

= {ieXU-yei(5 2 )jei(5 3 )) 

= {jc 6 X I len(jc - p) = r}. 

Thus the cross section s is completely determined and s(Ss) is the set of the points on the 
circle centered at p with radius r. In this way, (S fi , y, 3?, t,S 5 ) represents the circle. 

If KS3) = {p, q], it represents two circles with the same radius r centered at p and q. 
Thus, we can think of S3 as the space of centers of the circles. If t(S 1) = {r, t} instead, it 
would represent two concentric circles with radii r and t. If we modify the diagram to 

Tow v ((leno^Jx^r 1 T/ v (subxa-2)" 1 v ^ V Kl V 
K X A(i) > V X A(2) >■ A X A(3) s- A (4) 
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and let 2? = {Si} and define t £ T{5") by t(S\) = {(r^pi), (r 2 ,p2), •••}, then we have 

s(S 2 ) = {(v, x)eVxX\ (len(v), x) £ 

5(5 3 ) = {(*,?) £XxX\ (x-y,y) £ 5(5 2 )}, 

S (5 4 ) = |igI|^€X, (len(jc - e ?(5 0}, 

and we have as 5(54) the circles specified by the radius-center pairs in t(S 1). 

For another example, a line in X can be represented using the following diagram: 

V (1) .llL V x R (2) 

; sub -1 o mult (6) 
n -1 Y ff 

^(3) % x -^(4) ■>■ -^(5) 

Here, mult : V x R — > V is the scalar multiplication. Other maps are as above. Suppose 
that 2? = {5 1, 5 3} and that its cross section t is defined by 

KSi) = {v), t(s 3 ) = { P }, 

where p is a point in the Euclidean plane X and v is a vector in V. Let 5 be a cross section 
in r(Y, ,9", Jt \ t). Then, from (1) we have 

5(5 2 ) = Trr^M) = {(v, c) e V x R | c e R}, 
5(5 4 ) = sub~ 1 (mult(5(5 2 ))) n7r 2 _1 ({/?}) 

= sub _1 ({cv e V I c e R}) n n 2 ~\{p}) 

= {(x, p) £ X X X \ 3 c £ R, x - p = cv}, 
5(5 5 ) = {jc € X I 3 c e R, x - p = cv} 

= {p + cv I C € R}. 

Thus, the cross section 5 is completely determined and 5(5 5) consists of the points on the 
line that goes through p and has the direction parallel to v. 



3.2 Union 

As mentioned in 2.1, 



denotes the case when (1) in Definition 3 is required, i.e., 5 2 £ \ 5?' . Any cross section 
5 of the diagram satisfies s(S 2 ) = 0(5(5 1)) n 1^(5(53)). To denote the other case, we use 



5, Us 2 c^ 5 3 (7) 



16 



to indicate that 5 2 e <9" and s(S 2 ) = (p(s(S i))Uif/(s(S 3)). Thus, for any set in y, incoming 
maps are depicted with the round arrow. 

In the examples, we may use two kinds of incoming arrows as: 



It means s(S 2 ) = ((p(s(Si)) U i/f(5(S 3 )) U 9(s(S 4))) n 77(5(55)), i.e., we take the unions first, 
and then the intersection. This is simply an abbreviation of 



Si 



>5 6 €■■ 



id 



S 4 -V 2 - 5 s 

If we allow the complement map cmpl : 2 X — » 2 X in diagrams, we need only one of the 
conditions in Definition 3, because we can make unions from intersections or vice versa. 
Using cmpl, 

cmpl o0 cmpl 01ft 



Si 



■ 5 4 ->■ 
I cmpl 

5 2 



(8) 



5(5 4 ) = >(5(5 1 ))n>(5(5 3 )) 

5(5 2 ) = c 5(5 4 ) = 0(5(5 0) U ^(5(5 3 )). 
Thus, (8) is equivalent to (7). 



3.3 Representing maps 

A diagram with partial cross section can represent a map in the following sense: 

Definition 7. A map <p : 2 X -» 2 F is said to be represented by (^, S? , t, X, Y) if 

(J^, S?' , ^) is a diagram, 2? c f e r(^), X,Y e y, and every cross section 5 in 
r(y, y, Jt \ t) satisfies s(Y) = <p(s(X)). 

As an example, let us represent the map cl : 2 X — » 2 X that maps a subset A of a Eu- 
clidean space X to the topological closure A of A in X. Consider the diagram (y, J?", 

X m n, ' ] - X x A',,, ™ b " 2 - V X Z (3) ... (leno .." l)>or . 2 > R x X {4) > R X X (5) 

R(6) ^(7) 
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Here, the map Inf maps B c R x X to {(a, x) G R x X \ x G n^CB), a = inf n n 2 ' 1 (x))}; 
thus, for each x e X that appears in 5, there is an element (a, x) in Inf (5), where a is the 
infinimum of the set of real numbers b that appear as {b, x) in B. Now, if & = {S^}, t(S^) = 
{0}, s 6 T(J^, J^', JC | ^), and 5(5 = A, we have 



s(S 2 ) 
s(S 4 ) 



{(x,y) | xeA,y el} 
{(J, y) | y 6 X, 3 x e A, len(x - y) = d) 
{yeX\ inflen(x-y) = 0} = A. 



xeA 



Thus (^, M,Sr,t,S x ,S n ) represents cl. 

The infinimum map Inf in turn can be represented by 



XX 



(1) 



xRx! 



^2X(lto^[2)Xff3 



(2) 



Cmpl O 7T13 



x2xX (3) >RxZ 



"2 



-(5) 



(4) 
Max 

xX (6) 



with & = {5 5 },£(5 5 ) = {1}. The map It : R x R 2 maps (a,b) to 1 if a < b and 
otherwise, while the map Max maps B c R x X to {(a, x) G R x X | x G n 2 {B), a = 
max tt^B n 7r 2 _1 (x))}. Then if s e r(S? , 5?', Jt \ ST) and s(S ,) = S we have 

5(5 2 ) = {(a, Z?, x) | (a, x) e B,b e R}, 
5(5 3 ) = |(KU)| 3 (fl^)efi,«<M, 
5(5 4 ) = {(b, x) I a (a, x)eB,a< b}, 

s(S 6 ) = {(c, x) \ x e n 2 (B), c = max{b G R | ^(a, x) e B,a < b}} 
= {(c, jc) I x G 7r 2 (B), c = inf(fi n 7r 2 _1 (x))}. 

Thus (^, y, M,ST,t,Sx,Sd represents Inf. 

Finally, the maximum map Max can be represented by 



7T13 



(1) 



xRxX, 



7T\ X(lt o ff[2>Xff3 



A 

: Cmpl O 7T13 



(3) 



x2xl 



(4) 



"1 



-(5) 



with ST = {5 5 }, ?(5 5 ) = {1}. Then if 5 G r(3?, and 5(5 1) = B we have 

5(5 3) = {(a, Z?, x) I (a, x), (Z?, jc) g B), 

5(5 4) = {(a, l,x) I (a,x) G B, 3 (b,x) £ B,a < b}, 

5(5 2 ) = {(a, z)efl| $ (b, x)eB,a< b). 



Thus (^, y,^,£?,t,S U S 2 ) represents Max. 
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3.4 Recursive Definition 

Consider the following diagram: 

tt,-' I | add (9) 

id ; ° 
^(3) ' 3 

Here, add : X x V — > X is the parallel translation (jc, w) h-> x + w in the Euclidean space X. 
Suppose that & = {5 i , 5 3 } and that its cross section t is defined by 

fCSi) = {v}, ?(5 3 ) = {/>}, (10) 

where p is a point in the Euclidean plane X and v is a vector in V. Let 5 be a cross section 
in r(y, y, J( \ t). Then, from (1) we have 

s(S 2 ) = {(x,w) | x e 5(5 4 ),w e s(5i)}, 
s(5 4 ) = {/?}U{x + w| (x,w) e s{S 2 )} 

= {p} U {x + w \ x e s(S 4 ), w e s(S i)} (11) 

From (11), clearly 5(5 4 ) D D = {p, p + v, p + 2v, p + 3v, ■ ■ ■}, i.e., 5(5 4 ) contains 
equally spaced points beginning at p and separated by v. However, this does not uniquely 
determine the cross section: for instance, we can define s(S 4 ) = X; or indeed any set that 
is the union of D and a set invariant under the translation by v. 

To make it unique, we can take min S4 T{5^ , , <M \ t). Then it only contains the cross 
section with s(S 4 ) = D. 

Or we can use the following proposition. Let N = {0, 1, 2, 3, • • • } denote the set of 
natural numbers. 

Proposition 1. Suppose that a set S has a "grading" function g : 5 — > N and let 5 „ denote 
g (n)for h£N. Consider a map rj : 2 s — > 2 s that satisfies, for i G N, 

oo 

7/(50 cS i+u r](S) = (J 77(5 „). 

n=0 

IfS can be written 5 = So U 7/(5), ?/zerc 

DO 

5 = U*7 n (5 ). 

Proof. Since 5„ 5 OT if n ± m, for x 6 S n+ i with n G N, x <t 5 and x <£ n(S m ) c S m+ \ if 
m ^ n. Thus jc g 7/(5,,) follows from 

oo 

5 = 5 U 77(5) = 5 u|J 77(5 „). 

n=0 
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Thus5„+i c rj(S n ). Since rj(S n ) c 5„ + i, it follows r/(S „) = S n+ \. Therefore, S n = ?7(SVi) = 
rj(r](S n -2)) = ■•■ = if (So). The proposition follows from 

CO 

S = \Js„. 

□ 

To use Proposition 1, we modify (9) as: 

, 13 -4k (12) 
XxN (3) id ?XxN (4 ) -X,,, 

and define ^:IxVxN^IxNby ip({x, w,k)) = (x + w, k + 1) as well as modifying 
(10) to f(Si) = {v}, t(S 3 ) = {(p,0)}. Then (11) becomes 

5(5 4 ) = s(S 3 )U(p(s(S 2 )) 

= {(p, 0)} U {(x + w, k + 1) I (x, k) G 5(5 4 ), w e 5(5 1)}. (13) 

We define g : 5(5 4 ) -» N by g((x, k)) = k and 77 : 2 v(S4) 2' 5(54) by 

77(A) = {(jc + w, k + 1) I (x, k) G A, w G 5(5 1)}. 

Then g and 77 clearly satisfy the condition of Proposition 1. Thus it follows from (13) and 
the proposition that 

00 

s(S 4 ) = {Ji 1 ({(p,0)}) 

= {(p, 0), (p + v,l),(p + 2v, 2), (p + 3v, 3), • • • }. 

Thus r(y,y',^\t) contains only this cross section 5 with 5(5 5) = D, and D is 
represented by (y, 5?', Jt , ST, t,S 5 ). 
If we set t(S 1) = {v, -v}, then 

s(S 4 ) = {(p, 0)} Li{(x + v,k+l)\ (x, k) G 5(5 4 )} U{(jc- v,* + 1) I (jt, /c) G 5(5 4)} 

and thus 

5(5 5 ) = {••• ,p-3v,p-2v,p-v,/?,p + v,p + 2v,p + 3v, •••}. 

Moreover, if we set t(S 1) = {v, -v, w, -w}, then in general we get a grid points as shown in 
Figure 3(a). 
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Figure 3: (a) Recursively defined points, (b) Sierpinski triangle, (c) Hierarchically defined 
grid of circles, (d) Hierarchically defined lines. 





3.4.1 Sierpinski triangle 

Consider the diagram (J?, , ^#): 



X 



(i) 



1"! 



■■>■ X x X ( 2) ■ - V x 

A 



"2 
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- ! W<4) 
sub -1 o mult 
X X X(f.\ ■ >■ Xr 



Here, let ^ = {5 4 ,5 5 } and f(5 4 ) = {0.5}, f(5 5 ) = {/?}. Then for 5 e r(S fi ,S fi ',JK\ t), 

5(5 2 ) = {(*,/>) I x e s(5i)} 

5(5 3 ) = {(x- j p,0.5)U6 5(5 1 )} 

s(S 6 ) = {(y,p) I 0.5(x - j9) =y-p,x£ 5(50} 

s(5 7 ) = {p + 0.5(x -p)\xes(S 1)}. 

Let us define a map J p : X — > X for p e X by J p (jc) = p + (x - p)/2. Then, the power map 
of d p is represented by (y, y, 5 1, 5 7 ). 

For three points a, and c forming a triangle T in X, let us define a map : 2 X — » 2 X 
by <£w04) = d fl (A) U <4(A) U rf c (A) for A c X. Obviously, ^ ai)C can also be represented by 
a diagram. Then, consider the diagram 

•P'abc 



X (1) -'±., x x 



T cl o n-i v 
H2) *■ A (3) 



Here, <p', is <£y, c that increments the second component and id x maps A c X to A x {0} = 
{(x,0) I x G A} c X x N. Let ^ = {5i} and let t(Si) = T. Then by Proposition 1, 
s(S 2 ) = Ur=o^, c (T x {0}). Then 5 e T{5P,5P\Jt \ T) gives as s(S 3 ) the closure of 
U^=o *Pabc(T), which is a fractal set known as the Sierpinski triangle. See Figure 3(b). 
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X 0) (a) (b) (c) (d) 



Figure 4: A diagram to make repeated patterns with arbitrary patches, (a) A patch to be B: 
it can be any random data, (b) Regularly repeated patches, (c) A set A of random vectors, 
(d) Randomly repeated patches. 



3.5 Hierarchical Definition 

Combining (5) and (12), we consider (y, y, 



V ( i) -VxV'xH,,, 



len 



H3) 



(4) 



sub 



IxN <5) kl Xx 



"(6) 



■ ^(7) ■ ■■>■ X X -^(8) ' > ~ -^C 



(9) 



The left-half comes from (12) and produces the grid points in 5 7 , which is the space of 
center points in the right-half (5). With & = {S i,S 3 , S 5 ] and 

?(5!) = {v,-v, u,-u}, t(S 3 ) = {r}, t(S 5 ) = {(p,0)}, 

(y, y, & , t,S 9 ) represents a grid of circles as shown in Figure 3(c). 
Similarly, with the diagram (J?, y, .M): 



\/ ( i , - ■ X x V x 



«(2) 



V (3) ■■■*■■"■> V X 



T13 



K.(4) 

sub" 1 o mult 



■ X(i\ ■>■ X X Xtit\ ■>■ Xa 



X X N(5) ■ 3 X X N(6) ' >■ -A.(7) ■>- A A A(8) ■>- A(9) 

let ^ = {Si,S 3 ,S 5 }and 

KSi) = {v,-v}, ;(S 3 ) = M, f(5 5 ) = {(^,0)}. 

Then (y,y,^, £?,t,S 9 ) represents the lines that is parallel to u and go through the 
points {• • • , p - 3v, p - 2v, p - v, p, p + v, p + 2v, p + 3v, • • • }, as shown in Figure 3(d). 



3.6 Two-part Coding 

It may seem that the representation can represent only very regular and simple objects 
such as geometric shapes like circles and lines, except maybe by the trivial representation. 



22 



That is not the case. It can represent the regular part of the object sparsely and the ran- 
dom part densely, then mix them in various ways — intersection, union, hierarchically, and 
recursively — with any computation, in fact, as we will see. If there is regularity, it can be 
exploited. For example, in the diagram on the left of Figure 4, we can take any pair of sub- 
sets A c V and B c X and consider a partial cross section t such that t(S = A, ?(5 3 ) = B. 
This gives us, for instance, repeated patterns with arbitrary patches by setting A to be a reg- 
ular grid by (12) and B to be the patch (Figure 4(b), with the patch (a)). Using an arbitrary 
set A as well (c), we can also have a randomly repeated patches (d). This has still much 
more regularity than a completely random data. 

Separating the information in an object into a regular part and the remaining random 
part is the basic idea behind many formalisms such as Kolmogorov minimum sufficient 
statistic[10] and minimum description length (MDL)[1 1]. Our representation can also uti- 
lize this basic idea, which is how simple objects can be represented more concisely than 
random objects. 

The representation of lines and circles by specifying points and vectors is simple; With 
diagrams and cross sections, such simple representation and more complex ones involving 
any computation can be uniformly embedded in the spaces that the objects reside. We 
have shown a glimpse of such integration in the grid example and the Sierpinski triangle 
example. In the next section, we discuss further the representation of computation by 
diagrams and their cross sections. 

4 Computation 

With diagrams and their cross sections, we can represent computation. In this section, we 
first examine a few examples of explicit representation of concrete computation. We then 
show that any Turing machine can be represented by diagrams and cross sections. 

4.1 Examples 
4.1.1 Factorials 

Let succ : N — > N denote the successor function and mult : IxN->N the multiplication. 
Consider the following diagram y, M)\ 

NxN(i) ---J N x N(2) p ' ")v 

where (f = (n\ x mult) o ((succ o^x n 2 ) maps (n, m) to (n + l,m(n + 1)). Suppose that 
= {S i } and that its cross section t is defined by 

f(Si) = {«U)}. 

Let s be a cross section in r(,y, y', <M \ t). Then we have 

s(S 2 ) = s(S U <p(s(S 2 )) = {(0, 1)} U <p(s(S 2 )). (14) 
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We define a grading function g : N x N — » N by g((n, m)) = n. Then g and ip clearly 
satisfy the condition of Proposition 1, and it follows from (14) that 

DO 

s(S 2 ) = [J v(((0, 1)}) = {(0, 1), (1, 1), (2, 2), (3, 6), • • • , (*,/»!), • • • }. (15) 

n=0 

Therefore, r(y, y, J% \ t) only contains the cross section s defined by s(S = {(0, 1)} 
and (15); and (y, y, 3 ', t, S 2 ) represents the set of all pairs of natural number and its 
factorial. 

We can extend the diagram to 

NxN ( i) id 5NxN (2)r ")<f 

\ id 

N (3) ?."UnxN (4 ) ^' - M,s, 

Then, the (power map of the) factorial map is represented by (y, y, Jt , 3?, t,S 3 ,S 5 ). 

4.1.2 Fibonacci Number 

Consider the following diagram (y, y, ^#): 

N + x N + ( i) id 5 N + x N + (2) N + (3) 

where N + is the set of positive integers, <p = n 2 x add with add : N + x N + — > N + the 
addition. Thus (f((n, nij) = (m, n + m). Suppose that = {S i } and that its cross section t is 
defined by t(S = {(1, 1)}. Then if s e r(y, y, J?\t)we have 

s(S 2 ) = {(l,l)}U( P (s(S 2 )). 

If(n, m) e 5(5 2 )\{(1, 1)}, it must be the case that (n,m) e ip(s(S 2 )) and (m-n, n) e s(S 2 )- 
If this is not in s(S 2 ), then (2n - m,m - n) is in s(S 2 ), and so on. Since the sum of the 
two components decreases by this process, it cannot go on indefinitely and has to stop by 
reaching (1,1). Thus 

oo 

s(S 2 ) = |J <p({(l, 1)}) = {(1, 1), (1, 2), (2, 3), (3, 5), • • • } 

«=o 

and ^(53) = {1, 1, 2, 3, 5, 8, • • • }, which is the set of Fibonacci numbers. 
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4.1.3 Mandelbrot Set 



Consider the diagram ,9" , 



C 



x C x N(i) 



C 



c, 



■(3) 



A 



id 



-I 



CxCxN, 



(4) 



Here, C is the set of complex numbers. The map ip is defined by 



Suppose that 3? = {5 4, S 5} and that its cross section t is defined by 



f(S 4 ) = {(c, 0, 0) g C x C x N I c e C}, 
t(S 5 ) = {z&C\\z\>2} 



Let 5 be a cross section in r(S? , , M \ t). Then we have 

s(S = <^(s(S 1)) U 5(5 4 ) = <p(s(S 0) U {(c, 0,0) e C x C x N I c € C}. 
Define g : 5(5 1) — » N by g((c, z, k)) = fc. Then by Proposition 1, 



where z c „ is defined by z c = and = (z£) 2 + c. 

Thus /"X^, =5^', M 1 contains only one cross section s given by: 



s(Si) = {(c,z c n ,n) e CxCxN | c e C,n e N}, 
s(S 2 ) = {(c,z£) eCxC|ceC,neN, |z£| > 2}, 
5(5 3 ) = (ceC I 3 n eN,^| > 2}, 
5(5 4 ) = {(c,0,0)|ceQ, 
5(5 5 ) = {zgC||z|>2}. 



The Mandelbrot set is defined to be the set of complex numbers c such that z c n does not 
tend to infinity. It is known that z c n tends to infinity if and only if |z£| > 2 for some neN. 
Thus s(S 3 ) is the Mandelbrot set. 



(p((c,z,k)) = (c,c + z 2 ,k+ 1). 




M=0 
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4.1.4 Sum 



Let X be a finite set and let (a x ) xeX be real numbers indexed by X. Consider the following 
diagram (5? , 5?' , Jty. 



XxR (1) {san,)xn2 -2 v x 



id 



H2) 



2*xR (3) * 2 - 



M4) 



n\ ocmpl.- 



7I\2 



2 X 



7Z 34 



(5) 



(noff 13 ) 



2 x xRx2 x x 



H6) 



2 X 



(7) 



where the maps are defined as: 

s : X 3 x ^ {x} £ 2 X 
n : 2 X x 2 X 3 (A, B) ^ A n 5 e 2 X 
^:2 x xIx2 x xR3(A,j,fi,y)H(AU5,x + }')e2 x xR 

Suppose that ^ = {5 5 ,5 7 } and that its cross section t is defined by t(S 5 ) = {0} and 
t(S 7 ) = {X}. Let s be a cross section in f(y, y\ M \ t) and assume 



s(Si) = {(x,a x ) | x s X}. 



Then we have 



s(S 2 ) = [{({x}, a x ) | x e X} u ^(5 6 ))] n TrrV^s)) 

= {({*}, a x ) | ieX)U{(Aufi,fl + J) | (A,a,B,b) e s(S 6 ),AUB± 0} (16) 
s(5 6 ) = {(A, a, B, b) | (A, a), (5, 6) 6 s(S 2 ),AnB = 0} (17) 

Let us denote a A = ZxeA ®x for A c X, A ^ 0. 



Proposition 2. 



s(5 2 ) = {(A,a A )|AcX,A*0}. 



Proof. Define s„ = {(A, a) e s(S 2 ) I |A| = n] and f„ = {{A,a A ) \ A c X,\A\ = n) for 
n = 1,2, • • • , |X|. We use an induction on n to prove s n = t n . Suppose s\ 3 (A, a). If 
(A, a) is not of the form ({x},a x ) for some x e X, it is in (^(5(5 6)) and there must be 
(B, b), (C, c) e 5(^2) such that A = B U C and 5 n C = 0. However, it is impossible since 
|A| = 1 and there is no element in s(S 2 ) of the form (0, a). Thus S\ c t n . Since ^ D ^ by 
(16), this proves the case n = 1. Now, suppose n > 2 and s„ 9 (A, a). Since |A| > 2, (A, a) 
must be in tp(s(S 6 )). Then there exist (5, £), (C, c) e 5(5 6 ) such that A = 5 U C, 5 n C = 0, 
and a = b + c. But by the induction hypothesis b = a B ,c = a c so a = a B + a c = a A . 
Thus s n c t n . On the other hand, suppose (A,a A ) e t n and jc e A. Then ({x},a x ) & h = 
S\ c s(S 2 ) and (A \ {x},ax\w) e = c s(S 2 ) by the induction hypothesis. Thus 
(A,a A ) = <p(({x},a x ,A \ {x},a A \ M )) e s(S 2 ). Thus (A, a A ) e s„ and s„ D t n . □ 
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It follows 



s(S,) = {(X,a x )} 
5(5 4 ) = {a x }. 

Thus (y, y", M, 3?,t,S 1,54) represents the map 2Z X : 2 



■ x ({(z,a,)|xeX}) = ||]a,j 



For a general subset AcXxI,I x gives 



S*(A) = |^/(x) 



2 R that satisfies 



s.t. Vjc eX,(jc,/(jc)) eAi. 



4.1.5 Markov Random Field 

A Markov random field (MRF) consists of an undirected graph G = (V, E), a finite set L 
of labels, and an energy function <§ on the space L v of label assignments to vertices, or 
configurations. The energy function must be of the form: 



where denotes the set of cliques in G and E c a function on L v with the property that 
E C (X) depends only on values of X € L v on C. An MRF is of the first-order if E C (X) is 
constant unless C is a vertex or an edge, and thus the energy can be written as: 

g(X) = J^E^Xiv)) + ^ E 2 (u,X(u),v,X(v)) 



veV 



{u,v)eE 



with Ei : V x L -> R and E 2 : V x L x V x L R. We assume that E 2 (u, L, v, L') = unless 
(w, v) is an edge. Solving an MRF involves finding the configuration X with the minimum 

Consider the following diagram (J? 5 ', , M): 



VxL + VxLxVx L (l) >. (V + VxV)x m 



H2)' 



K3) 



AA 



W12 



(id+^12) 



j (id+ff 34 ) 



ltsup : 



VxLxL 



(4) 
V. 



ClTipl O 7T122 



V x L (5) '' 2 > V, 



(6) 



*(7) 
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Define the maps 



E\ = 7Ti x Ei : V x L -> V x R 
£2=ffiXff 3 x£ 2 : VxLxFxL^VxVxR. 

Also, define the map 

i/r:VxR+VxVxR^(V + VxV)xR 

by 

0- = Oi o 7Ti + i 2 O Jtn) X (7T 2 + 7T 3 ). 

Then 

£" = ij/ o (E\ + E' 2 ) : VxL+VxLxVxL— >(V + VxV)xR 

The map 2 v+VxV : (V + V x V) x R -> R is defined in 4. 1 .4. Also, define ltsup : 2 R -> 2 R 
by ltsup(A) = jjceR| 3 yeA,x<}'}, which can be represented by 

R >RxR >Rx2 »-R 

A 

m~ l \ 
2 

where the map <: R x R — » 2 maps (x,y) to 1 if x < y and otherwise, and a partial cross 
section is defined by s(2) = {1}. 

Suppose that J? = {S 4 , Se} and that its cross section t is defined by f(S 4 ) = 0, f(S 6 ) = V. 
Let s be a cross section in r(y, 5*" , ~# \ t). Then, from 

s(S 4 ) = {(v, l u l 2 ) | (v, h), (v, Z 2 ) e s(S 5 )} n c {(v, /, Z) | (v, /) e s(S 5 )} 
= {(v,/i,/2)l(v,/i),(v,/ 2 )e5(5 5 ),Zi */ 2 } 
= 

it follows that there is at most one element (v, /) e s{S^) for each v e V. From s(S6) = 
^2(s(S 5 )) = V, there must be one for each v, thus there is exactly one element (v, /) e s(S 5 ) 
for each v e V. Thus, 5(5 5 ) defines a configuration X 4 . e L v by defining X s (v) = I for each 
(vJ)es(S 5 ). Then, 

5(5 ={(v,X s (v)) \veV} + {(u,X s (u\v,X s (v))) \u,veV} 

s(S 2 ) ={(v, J E 1 (v,X i .(v)))) | v e V} + {((u,v),E 2 (u,X s (u),v,X s (v)))) \u,veV] 

s(S 3 ) ={£(X S )} 

s(S 7 ) ={x e R | x < S(X S )} 

Thus by minimizing on S 7 , mm Si r(y, y", J% \ t) gives the cross sections in s(S 5 ) that 
give the configurations with the minimum energy in s(S 3 ). Obviously, higher order MRFs 
can be treated in the same manner. 
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4.1.6 Finite Automaton 

Let (<2, Z, 6,q ,F) be a finite automaton, where Q and Z are the finite sets of states and 
symbols, respectively, and 6 : Z x 2 — > 2 is the transition function, g e Q is the initial 
state, and F c 2 is the accepting subset of Q. 
Consider the diagram , S" , ^#): 

Z* (1) Z* x Q x N (2) ,,,3. 2 - 1 x <2 (3) - n2 ' 1 -- 2(4) 

■■■■-...^i : ■■■■.■■5* ,: "2 

! id j 

G x N (5) ^ > rxyx N (6) Z* (7) " 0(8, 

Here, Z* is the set of strings on Z and 

I (t, <7, + 1) if r = 6 
6* : Z* x Q x N 9 (t, q, k) ^ I K H ' 

I (cdr(r), £(car(r), q), k + 1) otherwise 

where car(r) e Z is the first symbol in the string r and cdr(r) e Z* is the rest of the string. 
Suppose that 3F = {S i, S 4 , S 5 , 5 7 } and that its cross section t is defined by 

f(Si) = {cr}, t(S 4 ) = F, t(S 5 ) = {(q ,0)}, t(S 7 ) = {e}, 

where a e Z* is a string and 6 denotes the empty string. 
Let s be a cross section in T{S^ , S?' , M \ t). Then 

S(S 6 ) = {(cr,g ,0)}, 

s(S 2 ) = s(S 6 )U6*(s(S 2 )). 
Define g : s(S 2 ) — > N by g((r, g, fc)) = k. Then by Proposition 1, 

oo 

*(S 2 ) = U«T(j(S 6 )) = {(^)"((cr,^,0)) | n = 0,1,2,-..}. 

Thus s(S 2 ) is the whole execution history of the automaton, beginning with (<x, q , 0). 
Therefore, from 

5(5 3 ) = {(e,<?) I (e,q,k) e s(S 2 ),<7 e F}, 
^ 8 ) = (9l?£f, 3 (e,?)es(5 3 )), 

s(S g) is nonempty if and only if the automaton accepts <x. 
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4.2 T\iring completeness 
4.2.1 Turing Machine 

Let M = (Q, 2, 0, ^, 8, q , q A , <7r) be a Turing machine, where Q is the finite set of states, E 
is the input alphabet, is the finite set of internal symbols such that 2 c 0, u is the blank 
symbol in \ 2, 5 : ®xQ — > x <2 x {-1, 1} is the transition function, and q , q A , q R e Q 
are the initial, accept, and reject states, respectively. 
Consider the following diagram (y, , 



(i) 



0* x Q x N x 



'(2) 



2(3) 



Qx 



X 



"234 



id 



"2 



'(4) 



0* x Q x N x 



(18) 



<(5) 



2(6) 



Among the components of 0* x Q x N x N B (cr, q, n, k), the first three represent a configu- 
ration of the machine, with cr the content of the tape, q the state of the machine, and n the 
position of the head. The last component k is a step number. The map 6* : 0* x gxNxN — » 
0* x 2 x N x N is defined so that it updates the configuration of the machine by one step, 
as follows. 

For a string cr, cr[n] denotes the n'th symbol in the string. We use the zero-based index, 
so <t[0] denotes the first symbol of cr. 
First, we define 



77 : 0* x <2 x N x N x (0 x <2 x {-1, 1}) — > 0* x g x N x 



(19) 



Suppose (cr,q,n,k,(x,q',v)) G0*x<2xNxNx(0x<2x {-1, 1}). If q = q A or q = q R , 
T](cr, q, n, k, (x, q', v)) = (cr, q,n,k+ 1). Otherwise, i](cr, q, n, k, (x, q', v)) = (cr', q', n', k + 1), 
where n' = n + v unless n + v < 0, in which case n' = 0. As for cr', it is the string made 
from cr by (i) replacing cr[n] by x if < n < \cr\ — 1, or (ii) appending x if n = \cr\, where 
\cr\ denotes the length of the string cr. 

Next, define 8 : O* x QxM ^ O x Qx {-1, 1} by 



6>(cr, q, n) = 



\S(cr[n],q) if0<n<|cr|-l 
1 6(^, q) otherwise 



for (cr, q, ri) e 0* X Q X N 
Then 8* is defined by 



6*(cr, q, n, k) = T](cr, q, n, k, 9{cr, q, n)). 
Suppose that = {S i, S 4 , 5 6 } and that its cross section t is defined by 
t(S l ) = {cr}, t(S A ) = {(<7o,0,0)}, t(S 6 ) = {q A ,q R }, 
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where cr e E*. 

Let 5 be a cross section in 5?' , M \ t). Then 



s(S 5 ) = {(cr, ? o,0,0)}, 

s(S 2 ) = {(cr,4 ,0,0)}U<f(s(S 2 )). 



Define g : 5(5 2 ) — > N by g((r, g, n, £)) = k. Then by Proposition 1, 



CO 



s(S 2 ) = [J S* n (s(S 5 )) = {(6T(((r, q , 0,0)) |n = 0,1,2, •••}. 



n=0 



Thus s(S 2) is the whole execution history of the Turing machine M, beginning with (cr, q , 0, 0). 
Therefore, if M accepts the string cr, s(S 3) = {q A }; if it rejects cr, s(S 3 ) = {q R }; and if it 
does not terminate, s(S 3 ) = 0. 

4.2.2 Nondeterministic Turing Machine 

The case of nondeterministic Turing machine is similar to the deterministic case. Let 
iV = (Q, 2, 0, ^, 6', qo, q A , q R ) be a nondeterministic Turing machine, where Q, 2, 0, ^, q , 
q A , and q R are as before, and 5' : x Q — > 2 0xex( ~ 1 ' 11 is the nondeterministic transition 
function that gives the set of possible transitions. 

The same diagram (18) as the deterministic case will do with just a modification of the 
definition of 5* : 2 @ ' xQxmN -> 2 * xexNxN as follows: 

6* (A) = {r](cr, q, n, k, t) | (cr, q, n,k) e A,t e 6'(cr, q, n)}, 

where & : 0* x Q x N -» 2 0xex{ - 1 ' 11 is defined by 

(d'(cr[n],q) if0<n<|cr|-l 



and 77 is as before (19). 

The cross section has s(S 9 ) 3 q A if and only if accepts the string cr. 

5 Information Measure of Structure 

We define an information measure of general subsets. The measure takes into account the 
structure of the set characterized by maps, which we call the structure maps. Thus, it is a 
measure of information relative to the structure maps. Essentially, the measure is the size 
of the smallest diagram that represents the given subset, in which the maps in the diagram 
can be written in terms of the structure maps. 




otherwise 
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5.1 Maps Generating Diagrams 

Definition 8. Let be a set of maps. The set of maps generated by j$ is defined 
as follows: 

a) If / is in or it is id, to, or a projection map 7T,-, then / is in (^#). 

b) If maps / : X — > Y,g : Y — » Z are in (^#), then g o / : X — > Z is also in <^#). 

c) If maps /, : X — » Fj (/ = 1, • • • , n) are in (^#), then the product map f\ x • • • x f n : X — > 
Fi x • • • x F„ is also in <^#>. 

Note that for any X and xeX, i:l^Iisa map, which can be contained in the set 
^ of maps in this context. When ^( contains all maps x : 1 — » X, we write X c 

Definition 9. Let be a set of maps. For a map / in (^#), its size relative to M is 
defined as follows: 

a) If / is in jtft , or it is id, to, or a projection map n u then |/| ^ = 1 . 

b) Otherwise, / is either a concatenation or a product of some maps in (^#). In this case, 
\f\j( is 1 plus the minimum of the sum of the size of component maps relative to , 
among all possible combination, i.e., 

i) if / e (^#) is a concatenation, 

|/U = 1 + min (\g\ M + \h\J), 

f=g o h 

ii) if it is a product, 

|/U = 1 + min + • • • + |/„Lr), 

/l, 

/=/lX-X/„ 

Definition 10. Let ^ s be a set of maps. A diagram (J?, S?' , is said to be generated 
by if eacn ma P m ^ is either 

a) the power map / : 2 X — » 2 r of a map / : X — > y in (^#s) or 

b) the inverse map f~ l : 2 Y — » 2 X of a map / : X — > y in (^s). 

Definition 11. Let ^ s be a set of maps. A diagram J?", is said to be generated 
by <Jt$ with complement if each map in ^# is either 

a) the power map / : 2 X — > 2 y of a map / : X — » y in (^#sX 

b) the inverse map J" 1 : 2 ¥ —> 2 X of a map / : X — > y in <^#s) 5 or 
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c) a complement map cmpl : 2 X — » 2 X for a set X in 

Definition 12. Let be a set of maps and 5*", a diagram generated by ^ s with 
or without complement. For a map <p in its size \(p\^ s relative to ^ s is defined as 
follows: 



In the case of the circle example (5), the diagram is generated by „# s = Uen, sub} and 
each map in the diagram has the size 1 relative to The diagram (6) for line is generated 
by {sub, mult}. 

5.2 Definition 

We would like to define a measure of structural information of a subset A c X relative to a 
fixed set of structure maps by considering all data (J^, S?' , M, E? , t, X) that represents 
A and such that the diagram is generated by J%$, and then taking the minimum 

of the total size of the maps appearing in such diagrams. However, such a measure is not 
very useful without any restriction, since for any A c X and any such minimum is 
zero because of the trivial representation in which ,5? = S? = {X}, = 0, and t is defined 
by t{X) = A. 

To remedy this problem, we balance the data supplied by the partial cross section with 
their Shannon information [12]. 

Definition 13. Let X be a set, A its subset, and ^ s a set of maps. Also, let & be a set 
of probability spaces, each space 7 e ^ with a probability measure P Y . We define the 
structural information I (A \ £P)ofA relative to and & as the minimum of 



among all data (S^, <¥" , J{ , 2?, t,X) representing A such that 5*", Jt) is a diagram 
generated by „# s > each set in 2? is in and t(T) is a measurable subset of each T e 5? . If 
there does not exist such a diagram with which the sum is finite, we define I(A \ = 
<x>. We define the measure I C (A \ with complement similarly except that we allow 

diagrams generated by with complement. 

Imagine that X is itself a probability space and X 6 Then the existence of trivial 
representation gives the upper bound: 



a) if (f = cmpl, \ip\^ s = 1. 



b) If <p = f or cp = f 



1 with / e <^s), then \(p\^ s = |/U S . 




^log 2 P T (t(T)) 



I(A\^ s ,^)<-\og 2 P x (A). 
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In general, the sets in 2? can be thought of as parameter spaces, and the Definition 13 
allows for the treatment of information in an ensemble of objects, just like Shannon in- 
formation but taking the structure into account. However, in this paper, we focus on the 
information in an individual object. Thus, for the rest of this paper, we only consider the 
special case of I(A \ ^%, with & = {1}, making 1 a probability space with the prob- 
ability measure Pi defined by Pi(0) = 0, Pi(l) = 1. That is, we require = {1} and 
til) = 1 

Definition 14. The structural information measures I (A | ^# s ) and I C (A | M%) are defined 
as I(A \JZ$,0 s ) and I C (A | J? s , 3 s ) with s = {1}, respectively. 

For instance, the diagram for circle (5) can be modified to: 

1(1) R(2) - V(3) 

! sub -1 

P m.T l * »ri 

1(4) »- -^(5) ■ X x ^(6) ■> ^(7 1 

The constants rel and p e X are identified with the maps r : 1 — > R and p : 1 — > X with 
values r(0) = r e R and p(0) = p £ X (see 2.2.) Remember that we omit the 1 in diagrams. 
Thus the diagram for line (6) can be modified thus: 



. V 



V (1) V x R (2) 



p ; sub o mult 

-^(3) *■ ^ X -^i-li *• -^(5) 

The diagram is generated by = {p, v, sub, mult}. The map sub 1 o mult is actually not 
in (^#s)- However, in general, if we have tp : 2 s — > 2 T and if/ : 2 T —> 2 U , we can always 

think of S - U as an abbreviation of S - 7' - [7 . 

This way, visual patterns such as shown in Figure 1 and 3, as well as explained in 
Section 3, can be shown to have finite information. The circle C has the upper bound 
of I(C \{r, p, len, sub}) < 6, while the line L has I(L\{p,v, sub, mult}) < 7. When we talk 
about Euclidean space, we should include all maps that characterize the space X and related 
spaces of R, V, which would include len, sub, mult and also any constants. Thus, the proper 
set of maps to estimate the structural information of subsets of Euclidean space would be 
something like ^ E = {len, sub, mult} U X U V U R. The above estimates for the circle and 
line cases are unchanged: I(C |^e) ^ 6, 1(L < 7. 



6 Relation to Kolmogorov Complexity 

In this section, we show that the information measure defined in the previous section gen- 
eralizes Kolmogorov complexity. Here, we are only concerned with binary strings; an 
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extension to other alphabet should be straightforward. For a Turing machine M with bi- 
nary alphabet, let the partial function defined by M be denoted by the same letter. Also, let 
\cr\ denote the length of the string cr. 

Definition 15. The Kolmogorov complexity K v {a) of a binary string cr e 2* with respect 
to a universal Turing machine U is defined as 

Ku(cr) = min \p\. 

pe2',U(p)=cr 

6.1 Emulating Turing Machines 

It turns out that any Turing machine can be emulated by a diagram generated by = 
{0, succ}, where succ : N — > N is the successor function and : 1 — > N, if we represent a 
binary string cr e 2* by the subset 

cr = {(i, cr[z]) | i = 0, 1, ■ ■ ■ , \(r\ - 1} U {(|cr|, 0), (|cr|, 1)} C N X N. 

Theorem 1. Let M be a Turing machine with binary alphabet. Then there exists a finite 
diagram {y, ,9" , <JC) generated by = {0, succ} and two sets S = NxN and T = NxN 
in y such that {,9, JZ, {1}, s x ,S, T) represents the map M : 2 NxN ^ 2 NxN that maps 
& to f if M accepts cr leaving a string r on its tape, or to if M rejects cr or does not 
terminate with input cr. 

Proof. Let M = (Q, 2, 0, ^, 8, q , q A , q R ), where Q is the finite set of states, is the finite set 
of internal symbols such that 2 c 0, ^ e is the blank symbol, 8 : 0x<2 — » 0x<2x2is the 
transition function, and q , q A , q R e Q are the initial, accept, and reject states, respectively. 
We define Q = Q + {q e } and = + {□}. Consider the following diagram (,Y, y, Jt): 

; S* °qA , 

6 y y (20) 

N x N ( i) - init --3 N x x Q x N (2) NxGxgx N (3) "~ N x N (4) 

Here, S 2 is meant to represent the whole execution history of the Turing machine M. 

Let us assume that cr is a binary string and s is a cross section in T(y, M \ s{) 
with s(S\) = cr. Fork e N, let us call the set 7^ = {(i,x,q,k) e s(S 2 ) I (i,x,q) eNxQxQ} 
the fc'th row. We call the Rk proper if the following condition holds: 

R k = R k +R k ; 

R{ = x ik, q e ,k)\i = (),■■■ , 4}, x hk = □, x ik + □ for j < l k ; 

R 2 k = {(i, x ik , qk, k)\i = m k ,- ■ ■ , n k \, q k + q e , < m k <n k < l k ; if q k # q A then = 

In a proper row, represents the z'th symbol on the tape at /c'th step in execution of 
M, except for the last entry in the row, which is □. When q k ± q A , the single element 
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(m k ,x mk k,qk,k) in R\ indicates that M is in state q k and its head is at position m k . When 
q k = qA it means that the machine is in the accept state. 
The map init sends the subset & to 

init(cr) = {(i,o-[i],q e ,0)\i = 0,--- , |o-| - 1} U {(|o-|, n^O)} U{(0,cr[0], q ,0)}, (21) 

setting up the O'th row properly. 

The map 6* : 2 Nx ® x 2 xN 2 Nx ® x <2 xN maps R k to R k+l with the following properties: 

1. If R k is proper and q k + q A , 6* updates the configuration by emulating M, i.e., R} is 
copied to Rl +l except that x,„ kk is mapped to x,„ kk+l according toM's tape rewriting, 
and that the single element (m k , x,„ kk , q k , k) of R 2 k is mapped to (m k+ i , x,„ k+lk , q k+l ,k+l) 
in Ri +V where m k+ \ is the position of the head at step k + 1. 

2. If k'th row is proper and q k = q A , R\ is copied to R l k+l and R 2 k is expanded. That is, if 

R\ = {(i, Xik, qA, k) | i = m k , ■ ■ ■ , n k }, 

then 

R 2 k+l = {(/, x ik , q A , k)\i = max(0, m k - 1), • • • , min(4, n k + 1)}. 

In Lemma 6, we show that such a 6* can be represented using a subdiagram. If we define a 
grading map g : Nx0x QxN -> Nby g((i, x, q, k)) = k, it is clear from the proof of the 
lemma that 6* satisfies the requirement of Proposition 1, which is 

oo 

6*(Si)cS i+u 6*(S) = \Jd*(S n ), 

n=0 

where S = Nx0x()xN and 5,- = g~ l (i). Thus from the proposition we have 

oo 

s(S 2 ) = {j6* k (imt( S (S l ))) 

k=0 

We then have the following: 

s(S 3 ) = {(i,x,q,k) e s(S 2 )\q = <7a}- 

If M reaches the accept state at fc'th step, R\ would have one element with the accept state 
q A . The rows after that would have increasingly more q A without changing the symbols, 
until R\, = {(i,x ik ,q A ,k') \ i = (),•••, 4}. The map 6 : 2 Nx ® ^ 2 NxN just fixes the 
termination of the string, removing the blank symbols that might be left at the end of the 
string and properly terminating: 

6(A) ={(i, x) | (i, x), (i + 1, y) e A, x, y 6 2} U {(0, 0), (0, 1) | (0, y) e A, y e B}U 
{(i,x),(i+ 1,0), 0" + 1, 1) | (i,x),(i+ l,y)eA,xe 2,y e B}, 
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where B = { u , □}. The second part of the RHS is to take care of the case when the string 
is empty. Thus, 5(5 4 ) contains the string that remains when M reaches the accept state if it 
does, and is empty if it does not. 

Without loss of generality, we can assume that Q = n,q e = n and = m, □ = m. 
Remember, in our notation, the finite subset {0, 1, • • • , n - 1} of N is denoted by n. Thus, 
constant maps such as q e ,qo are natural number constants. Any natural number constant 
map k can be made from and succ as succ^ o 0. 

It remains to prove that the maps init, 0, and 6* can be represented by a diagram gener- 
ated by ^# N , which is done in the following lemmas. □ 



Lemma 1. The map init : 2 

diagram generated by 

Proof. Consider the diagram 



2Nx@xgxN tnat sat i s fi es (21) can be represented by a 



x 



(idxOr 



(idxir 



xN (2 ) 

i id 



JliT o0 "■, 



(4)' 



idxn 



■oNx© 



•T12 



^13 



0xq 
(3) 

-I 



(5) 



■NxOxgxN, 



(6) 



Let cr be a binary string and s a cross section of the diagram such that s(S\) = cr = 
{(i,o-[i\) | / = 0, 1, •• • ,M - 1} U {(M,0),(|o-|, 1)}. Then 

s(S 2 ) = {(i, x) | (i + l,x')e siSi), x' e 2, (/, x) e s(S 0} = {(i, cr[i\) \i = 0, • • • , H - 1}, 

S(S 3 ) = {(/, q e ) | (i, X) 6 S(S X ), X 6 2} U {(0, a )} = {0', ?e) I i = 0, ■ • ■ , \o\) U {(0, 9o )}, 

5(5 4 ) = {i e N | (i,0), (i, 1) e 5(5 0} = {Mh 

s(S 5 ) = {(/.o-M) | i = 0, 1, • • • , |cr| - 1} U {(|tr|, □)}, 

S(S 6 ) = {(/, o-[i], ft, 0) | i = 0, • • • , \o-\ - 1} U {(M, □, q e , 0)} U {(0, (T[0],q , 0)}. 

Thus, 5(5 6 ) is the result of applying init to 5(5 1). □ 

Lemma 2. The map 6 : 2 Nxe> — > 2 NxN ?/ia? satisfies (22) can Z?e represented by a diagram 
generated by 

Proof. Consider the diagram 



7T13 



x®(D 

; (succ o ffj)x^2 



X0X0 



(2) 



id 



7T12 



x 



(5) 

\ (0,0) 



,■(0,0) ,-(0,1) 

x (6) 

\ (1,0) ,.1.1} 



"23 



/(O,-) /(1,J 

■■>■ N x x (3) r;:r ^ x (4) 



wixO 



X0X0 



(7) 



ffi xl 



F-..(l.°) 



■3nx. 



37 



Here, a = ((succ o m) x n 2 ) 1 o m 2 . Then, for a cross section 5 of the diagram with 
s(Si)=A, 

5(5 5 ) ={(/+l,x)|(*,x)eA}U{(0,0)}, 

5(5 2) ={(i + 1 , x, y) I (/, x), (/ + 1 , y) e A} U {(0, 0, y) | (0, y) e A}, 

5(5 4) ={(x,y)\xe2,yeB}, 

5(5 3) ={(i,x,y)es(S 2 )\xe2,yeB} 

={(/ + 1, x, y) I (i, x), ((+lj)eA,xe2,yeB}U {(0, 0, y) | (0, j)eAje B), 
s(S 6 ) ={(x,y) \x,y £ 2}, 
5(5 7 ) ={(i,x,y) e 5(5 2 ) I x,y e 2} 

={(/ + l,x,y) I (/, x), (/ + 1, y) e A, x, y e 2} U {(0, 0, y) | (0, y) e A, y e 2} 
5(5 8 ) ={(/, x), (/ + 1, 0), (/ + 1, 1) I (i, x), (i + 1 , y) e A, x e 2, y e B}U 

{(0, 0), (0, 1) I (0, y) 6 A, y e 5} U {(/, x) | (i, x), (i + l,y) e A,x,y e 2}, 

where 5 = { u , □}. Thus, 5(5 g) is the result of applying 8 to 5(5 1). □ 

Lemma 3. Let 6 be a map from mxnto mxnx2. Then 8 can be represented by a diagram 
generated by jft-$. 

Proof. Let k be the larger of n and m. The diagram has three parts: 5 a,; = N for i = 
0, • • • ,k—l; Ss,ij = »iXnx#iX#x2for e mxn; and5 c ,o = wxn,Sc,i = mxnx2. 
The first part is as follows: 



■. 



succ H SUCC SUCC H 

*(A,0) ■> > i, \.:; " ' " >■ •W(A.A-l) 



This part has no incoming arrows from other parts. Thus, any cross section 5 has 5(5 A ,;) = 
{/} for i = 0, 1, • • ■ , k - 1. 

The other parts are organized thus: 



m x n (C ,o) >• mxnxmxnx2 



(B,0,0) 

^345 



"J 

_ 77345 

mx nx mx nx 2 (B ,m-i,«-i) ■■■■■ 3 mxnx 2(c,i) 



with maps, for each (/, j) e mx n, as follows: 



*(A,0 ! >- mxnxmxnx 2 {BJJ) < 3 N (A , ( 5 1 (/j)) 

"2 " in.r' '"■■■■-■" 5 . 

WO N(A,fi 2 (/J)) N(A,fl 3 (i,y)) 
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where 6(i, j) = (c*i(z, j), 8 2 (i, j), 8 3 (i, ;')). 

Let s be a cross section of this diagram. Then for each (z, j) e mx n, 

s(Sbjj) = nn~ l (s(Sc fl )) n nf l (s(S AJ )) n ^^(^aj)) 

= {(/, j, ;), 6 2 (i, j), 8 3 (i, j)) e m x n x mx n x2 \ (i, j) e s(S c ,o)} 

_ ({(i, j, j), 6 2 (i, j), 5 3 (i, j))} if (i, j) e s(S c ,o) 
"{© if(/,j)0(S c ,o) 

and thus 

s(S c ,\) = [J ^345(^(5 B,/,y)) 

(i,j)es(Sc,o) 

= {S(i,j)\(i,j)es(Sc,o)} = S(s(S c,o)). 



Lemma 4. 77ze map $ : 2 Nx ° xex2xN -> 2 NxexN defined by 

^(A) = {(0,q,k) \ (0,x,q,0,k) e A} 

U {(/ + 1, q, k) I (z, jc,?,U)eA}U {(z, g, /c) I (j + 1, x, q, 0, /c) e A} 

can be represented by a diagram generated by 
Proof. Consider the diagram 

NxQx. 



'(3) 



("■12x1x^3) 1 

(sUCCOff])Xff23 



Y 



Nx0x£x2xN (1) , MxQxlx N^^^t NxQx N (4) 

iOx.T|X()x--) : * 

: 0X^12 

""""Qx : N (5) 

Then we have 

s(S 2 ) = {(i,q,v,k) I (i,x,q,v,k) s 5(5 1)}, 
s(S 3 ) = {(i,q,k) I (z',x,g, 1,/c) e s(S 
s(S 5 ) = {(q,k)\(0,x,q,0,k)£s(S 1 )}, 

s(S 4 ) = {(z + \,q,k) I (z',x,g, 1,/c) 6 5(5 1)} U {(z',a,/c) | (z + 1, jc, q, 0, /c) e s(Si)} 
U{(0,q,k)\(0,x,q,0,k)G S (S0}. 

Thus we have 5(S 4 ) = ip(s(S 1)). 
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Lemma 5. The map x ■ 2 -» 2 NXN such that 

X({(i,k)}) = {(j,k)\jeM,j*i} 
can be represented by a diagram generated by 
Proof. Consider the diagram 



XJN(i) 



-3 jn x n (2 ) r 

id 



!, „ ■■■■■■■> JN X .W,4, 

Here, s\ = (succ o n\) x 7T 2 and s 2 = ((succ o tti) x 7T2) -1 . Then if s(S i) = {(i, A;)}, 

s(5 2 ) = {a*)l7 = » + l.i + 2,"-}, 
5(5 3 ) = {0-,fe)l7 = /-l,j-2,---,0}, 
5(5 4 ) = {(j,fc)| jeNj^i}. 



□ 



Lemma 6. 77ze map 5* : 2 Nx ® x 2 xN -> 2 NxSx 2 xN wzY/z the properties in the proof of the 
theorem can be represented by a diagram generated by ^# N . 



Proof Consider the diagram: 

N x x Q x n (1 ^ ( ^ °^ N x x Q x 

^123X(SUCC 0^4) : X on i4 



WlX((5off23)XT4 



5 (2)' 



Nx0x gx2x 



x x Q x 



/' ^3 ' (*:>4a) 

(4) 



V 



\7r 3 - 1 o(Q\{9e,?A}) 



N x N (5) 



xQx 



■ 7T 2 o q A 
(7) 



Tl3 



^125 



N x x N (6 ) 



^124 



x0xN, 



xQx 



'(10) 



(S) 



?T2 1 \ 

)NxgxN (11) 



x x 2 x N, 



^134 



x x 2 x 



(9) 



'(12) 



For k e N, let us assume that s(Si)'s &'th row R k is proper. We see that s{S 2 ) contains 
the single element (i,x,q,k),q ± q A in R 2 k and s(S 4) the rest, with the step number k 
incremented. Note that {q e , q A } and Q \ {q e , q A } in the diagram denotes the constant union 
maps 1 — > Q whose images are {q e , q A } and Q \ {q e , q A }. 
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The map^ ° tti 4 sends to 

s(S s )= |J {{j,k)\j±i}. 

(i,x,q,k)es(S 2) 

Thus, s(S 5 ) contains exactly the position-step pair (z',/c) such that there is no element 
(i,x,q,k) in s(S 2 ). The element (i,x,q,k) in i?? indicates that the head is at position z 
on step k. Thus 5(5 5 ) contains the position-step pair everywhere except where the head is, 
and s(Sg) contains the symbols that are not under the head. 

The symbol-state pair (x, q) from (1, x, q, k) in R 2 k is fed to 6 and the result is 

s(S 3 ) = {(i,x',q',v,k) \ (i,x,q,k) € s(S 2 ),S((x,q)) = (x',q',v)}, 

which contains the symbol x' to replace the one at the current position z, the new state q', 
and the direction v to move the head. The new symbol is sent to s(S ^) where it is combined 
with the symbols that are not changed, which are contained in s(S §). The direction to move 
the head is indicated by v: if v = 0, the head is to be moved to the left, and if v = 1 to the 
right. The map if/ in Lemma 4 is defined so that the position is moved accordingly, while 
also taking care of the case (i, v) = (0, 0), when the head is indicated to move to the left 
and the position is 0, and therefore has to stay at 0. Also, without loss of generality, we 
can assume that 5 satisfies S((x, q R )) = (x, <7r, 0), which makes the simulated head to move 
to the left without changing any symbol after the rejection state is reached, until it reaches 
the leftmost position, where it stays. 
Coming back to 5(54), 

s(S 7 ) = [(i,q A ,k) I (i,x,q A ,k) e s(S 4 )}, 

takes position-step pairs with state q A . The map (p '■ 2 NxexN — > 2 NxexN is defined so that 

s(S 10) = <p(s(S 7 )) = {(i,q A ,k) \ (i +l,q A ,k)e s(S 7 )} 
U {(i,q A ,k) I (i,q A ,k) e s(S 7 )} 
U{(z'+ l,q A ,k) I (i,q A ,k) e s(S 7 )}, 

which propagates the positions with accept state q A . This can be represented by 

NxgxN 

Now, the position-state-step triples are combined in s(S n): 

s(S u ) = <AO(S 3 )) U j(5 10) U {(i,q e ,k) \i,ke N}, 
where the last part ensures that the state q e is in R l k everywhere there is a symbol. 



((SUCC Offi)X^23) 

' ?Nx Qx 

(SUCC Off()Xff23 " 
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The states and the symbols are combined in 5(5 9): which is combined with the symbols 
to make 

s(S 9 ) = {(i,x,q,k) I (i,x,k) e s(S 6 ),(i,q,k) e s(Sn)}. 

Finally, the map 77 : 2 Nx0xexN —* 2 Nx0xexN takes care of the case when the head moves 
past the right end of the string: 

T]((i, □, q, k)) = {(i, ^,q,k),(i+l,n,q e ,k)}, (q * q e ,q # q A ), 
rj((i,x,q,k)) = {{i,x,q,k)\ otherwise. 

This can be represented by: 



/ 7T 2 3 1 o (ax(Q\{q e ,q k \)) 



id 



7T[ X„Xff34 



N x x Q x N - N x x Q x N ::::::::::::::::::::::::::::::::::3 N x x Q x N 

(succ o jri)xnxg e x.T4 7, 7, 
id ■"' 

/ w ,-'°(9\{n)) 

"■■■■ ld N x x Q x N 

id 

''*> 

NxOxgxN 

-5 

□ 

Theorem 2. For any universal Turing machine U, there exists a constant c v e N swc/i ?/?a? 

I{& < 6Ku(o-) + c v 

for any binary string o~ e 2*. 

Proof. By Theorem 1, there exists a diagram {y, y', ^) generated by that emulates 
U. By definition, there exists pel* such that U(p) = cr and \p\ = Ku(cr). There are sets 
S = N x N and T = N x N in y, as S 1 and S 4 in (20), such that s(T) = & if s(S) = p, for 
any cross section s of (y, y, ^# | si). Add to the diagram the following: 



, 
"A 



with maps from S A j to S : 



SUCC TI>T SUCC SUCC T^y 

Ha,0) " N ( a,1) ■ ■ ■ ■>• N ( A,|p|) 



_ idxpfi] 

f(A,0 ' 3NX 



for i = 0, 1, . . . , \p\ - 1, where id x p[i] :I->NxN denotes the map id x if p[i] = and 
id x (succ o 0) if p[i] = 1. Finally, we add maps 



idxO 

N (A .ipi) Nx 

idxl 
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from S A ,\ P \ to S . Then any cross section s of the resulting diagram satisfies s(S) = p, and 
thus s(T) = &. 

Maps succ and id x p[i] is added for each symbol in p. The size of succ is 1 and that 
of id x (succ o 0) is 5. Thus the size of the added diagram is at most 6\p\ + const. Since 
the only part of the diagram that depends on cr is the part added here, the existence of this 
diagram proves the theorem. □ 

Corollary 1. For any binary string cr e 2*, I{& zs bounded from above by 

c- min (\p\ + \e M \\Q M \), 

M:TM,pe2* 
M(p)=cr 

where c is a constant and ©m and Qm are the sets of internal symbols and states of Turing 
machine M, respectively. 

Proof. Use the same construction as in the proof of Theorem 2 with any (possibly non- 
universal) Turing machine. It can be seen from the proof of the theorem and Lemma 3 that 
the size of the diagram emulating a Turing machine M is 0(|0 m ||<2mI)- d 

6.2 Reading Cross Section 

Conversely, if a finite diagram generated by can represent & for some binary string cr, 
there is a Turing machine that produces cr and terminate. 

Let X be a set of Boolean variables, i.e., variables that take values in 2. We use the 
standard notion like logical AND (A) and OR (V), treating 1 as true and as false. By 
an assignment to X, we mean a map / : X — » 2, which assigns or 1 to each variable 
in X. If Y c X, there is a natural map 2 X — > 2 Y by restriction that maps an assignment 
to X to an assignment to Y. A logical constraint ^ on a set X of Boolean variables is a 
map^ : 2 X — > 2, which is said to be satisfied by an assignment / e 2 X to X ifx(f) = 1- 
For a logical constraint x on ^> an assignment g to a subset Y of X is said to satisfy x if 
every assignment that restricts to g satisfies it. For a set C of constraints on X, we say an 
assignment to X satisfies C if it satisfies all the constraints in C. 

Theorem 3. Let {5? , S?' , be a finite diagram generated by such that the data {5?, 
, {1}, S\, S) represents & c S = N x N for a binary string cr, where Si is the 
cross section of {1} defined by si(l) = 1. Then there exists a Turing machine that takes 
an encoded description of any such diagram and produces cr. Therefore, for any universal 
Turing machine U, there exist constants du, eu £ N such that 

Ku(cr) < dul(& \Jt n ) + e v 

for any binary string cr e 2*. 
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Proof. For each element t of each set T in y, we define a Boolean variable x t . Also, 
for each set T in y, each map ip G in(T'), and each element ? of 7\ we define a Boolean 
variable yf. Let us denote the set of the variables by X. The variable x t is for indicating 
if t is in the cross section, and yf is to indicate if t is in the image of the cross section by 
(p. We define a set C of constraints on X to establish a one-to-one correspondence between 
the cross sections in T{y , 5?' , M \ s{) and the assignments to X satisfying C. We let C 
contain the following constraints: 

i) For each map <p e 

a) If <p is the power map / : 2 U — » 2 T of / : U — > 7\ C contains the constraint yf = 
for each t eT \ f(U) and, for each f e f(U), 

y v t = V x «- 

b) If 9? is the inverse map f~ l : 2 U — » 2 r of / : 7 1 — » £/, C contains the constraint 

^ = 

for each (eT. 

ii) For each set T in y \ y such that in(T) + and each t in T,C contains the constraint 

* = A y*- 

iii) For each set T in y' such that in(r) ± and each Hn T, C contains the constraint 

* = V y<- 

yein(r) 

iv) For the variable x corresponding to the element 6 1 that appears in the data 
(y, y, {1}, Si,S), C contains the constraint x = 1. 

Note that for each variable x G X, there is exactly one constraint in C with that variable 
on the LHS. We call it% x . Note also that any conjunction in C has only a finite number 
of variables and that there are only countably many variables in X. We fix a one-to-one 
correspondence v : X — > N for later use. 

For a cross section s of y, we define an assignment g s to X by defining g s (x t ) = 1 if 
and only if t e s(T) for each T e y, t G T and g s (yf) = 1 if and only if t e ip(s(dm((p))). 
Then g s satisfies C if and only if s is a cross section in r(,y, y, M | si), as follows. 

First, suppose that g s satisfies C. If T G y \ y, then g s (x t ) = A^einir) g^Cyf) an d tnus 
* g j(D (i.e., g s (x t ) = 1) if and only if t G rUinm ^(dm(^))). If T G then g s (x t ) = 
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Vainer) g s (y?); thus t e s(T) (i.e., g s {x t ) = 1) if and only if t e \J vein(T) ip(s(dm((p))). 
This proves that s is in r{5? , Finally, the constraint g iS (*o) = 1 f° r the vari- 

able x corresponding to the element e 1 ensures that s(l) = 1, and thus s is in 
r(y, M | Sl ). Conversely, suppose that s is in r(Y, y, Jt \ s x ). If T e & \ y, 
g s (x t ) = 1 (i.e., t e s{T)) if and only if t e (~} vein(T) ip(s(dm((p))); since g,(yf) = 1 if and 
only if t 6 ^(dm(^))), g,(x f ) = A^d g,(y?)- If ^ e = 1 (i.e., * e 5(D) if and 

only if t e \J vein{T) (p(s(dm(<p))); thus g s (x t ) = \f^ ia(T) g s (yf). Finally, g s (x Q ) = 1 is satisfied 
since s(l) = 1. 

We define a subset X, of X for / e N by 

X = {x }, 
X i+ i = Xj U {x 6 X I x is forced by X,}. 

Here, a variable x 6 X is said to be forced by a subset y of X either if ^ Y is a disjunction 
and at least one variable on its RHS is in Y or if% x is a conjunction and all the variables on 
its RHS is in Y. Note that, if an assignment g to X satisfies C, g(x ) = 1 and, by following 
the definition, any variable x in X,- has g(x) = 1 . 
We define a function r on X: 



r(x) = 



\i if *6X,-\*m 
00 otherwise 



and using r, we define an assignment hto X by: 

/i(x) = 1 <==> t(x) < 00. 

Then h satisfies C. To see this, assume that there is a constraint^ e C that is not satisfied 
by h. Ifx is of the form x = Ayj with a finite number of y/s, either /z(x) = 1 and h{yj) = 
for some y,-, or /z(x) = and /i(y ; -) = 1 for all y 7 . Neither is possible by the definition of 
h: if h(yj) = for some yj, r(yj) = 00 and thus t(jc) = 00; if r(y 7 ) < 00 for all yj, then all 
y/s are in X^, where k is the maximum of r(y 7 ), and thus t(jc) = /c + 1. If ^ is of the form 
x = Vyj, either h(x) = 1 and h(yj) = for all yj, or /z(x) = and h(yj) = 1 for some yj. 
These are not possible either: if h{yj) = and thus riyj) = 00 for all y ; -, then t(x) = 00; if 
/z(y/) = 1 for some yj, r(y 7 ) < 00 and t(x) < r(y ; ) + 1 . 

For any x in X with t(x) < 00, define a finite subset Y x of X tW as follows: 

x e y x , 

y g Y x , Xy = "J = Ay/' => Vy,- e 7„ 

y e y v , ^ v = "y = Vy/' => arg min v(y y ) e 7*. 

>7,t(vj)<t(v) 

Since only a finite number of variables appear in any conjunction in C, for a variable y 
already in Y x , each rule adds at most a finite number of variables y ; , which all satisfies 
r(y ; ) < r(y). Thus there are only a finite number of variables in Y x . Also, if we denote 
Y l x = Y X C\ X { , each variable in Y\ is forced by Y'~ x for z = 1, • • • , r(x). 
Consider the following algorithm: 
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IsOne(x) 

1 for each increasingly large finite subset Z cX such that x ,x e Z 



2 Y <- {x } 

3 for? = 1,2,- •• 

4 y «- y,-.! 

5 for each zeZ 

6 add z to y if z is forced by Y t _i 

7 until y = y i 



8 until jc e y for some z 

On line 1, we make sure the increasing subset Z will contain each variable eventually by 
adding variable v~ l (k) for k = 0, 1,2, • • • . 

If t{x) < oo, IsOne(x) terminates in finite steps, since Z will eventually include Y x . 
Conversely, if IsOne(x) terminates in finite steps, the set y that contains x is a subset of X t . 
Thus t(x) < oo. 

Let us assume s e r(^, M \ si). Since the data (y, y", M, {1}, s\, S) represents 
cr c S = N x N, any cross section 5' in r(,y, J7", ^ \ s{) satisfies s'(S) = cr. Thus, for 
any assignment g to X that satisfies C, = g s (t) for any t e S . In particular, we have 
/z(/) = g s (t) for any ? e 5 . This means that T(x^b)) < 00 for (/, b) 6 S if and only if cr[/] = 

Now, we define a Turing machine M that executes the following algorithm: 

1 Given (^,J^',^r,{l},j lf S) 

2 Let p be an empty string variable 

3 for i = 0, 1 , • • ■ 

4 run IsONE(Xf) in parallel for t = (z, 0), (z, 1), and (z - 1, 1 - p[i - 1]) if z > 0. 

5 until one of the parallel processes terminates 

6 if i > and IsOne(x(,_i,i_ P [ ! _i])) terminated 

7 then pop p[z - 1] and terminate returning p 

8 else if IsOneCjc^o)) terminated 

9 then p[z] <- 

10 else if IsOne(x (; j)) terminated 

1 1 then p[z] <- 1 

For i = 0, 1, ■ • • , |cr| - 1, IsONE(X(/ i(r [i])) terminates in finite steps but the other processes do 
not. For i = \cr\, either IsOne(x( (! o)) or IsOne(x(; i)) can terminate first. For i = \cr\ + 1, only 
IsOne(X( I _u_ -[ I _i])) terminates. Thus, M always terminates returning cr. 

Finally, let (y, y, {1}, si,S) be the diagram that has size I{& |^# N ). Since the 
data can be encoded in a string of length 0{I{a |~#n)) an d emulating M on U to run on the 
string produces cr, there are constants du, eu e N such that 

^rj(cr) < dul(a |^# N ) + 

that do not depend on cr. □ 
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By Theorem 2 and Theorem 3, we can say that is equivalent to the Kol- 

mogorov complexity Ku(cr). Thus, the formulation gives the larger class of objects a 
meaningful measure of information that generalizes Kolmogorov complexity. 

7 Discussion and Conclusion 

How is the information measure we defined different from Kolmogorov complexity in the 
general case, not the case in the previous section? Let us take the class Sf of subsets of 
Euclidean plane X as an example and try to follow the common prescription: that is, we 
fix an encoding of the objects into strings; and then we define the Kolmogorov complexity 
of the string encoding an object as its complexity. Consider 

u f 

S — » S -^W (23) 

where S is the set of all strings and U is the partial map defined by a universal Turing 
machine. The map /' is an encoding of objects by strings. The common notion is that we 
should define the length of the shortest string s such that x = f'(U(s)) as the Kolmogorov 
complexity of x. However, as we mentioned in 1.1, because the cardinality of 9? is larger 
than that of S , we need to either encode only some of the objects, encode multiple objects 
by each string, or both. To allow for both possibilities, we consider 

S S' J- <g (24) 

instead. Here, S' = S + {|}, where f(x) = 1 means "x is not encoded." For an object x, 
the length of the shortest string s such that f(x) = U(s) is defined as the amount Kf tU (x) of 
information in x, unless f(x) = T, in which case Kfjj(x) = oo. 

The question is: does there exist a map / that makes Kfjj(x) equivalent to I{x 
The trivial answer is: yes, we can define / using I(x |^#e)- If K x I-^e) is finite, we define 
f(x) to be the first string s (in some standard order) such that the shortest string s' with 
U(s') = s has length I(x |^#e); otherwise we define f{x) = f. But this only highlights the 
meaninglessness of this kind of discussion without restricting / in some way: as we noted 
in the introduction, just about any "complexity" can be realized this way. 

Let us instead try to define a reasonable encoding / to see if it gives rise to anything 
close to I(x |~#e)- First, a standard way to encode points in X would be to identify X with 
K 2 and encode the two coordinates of a point by an infinite sequence. We can also encode 
a countable set of points by a sequence similarly by dovetailing between more and more 
points, enumerating the digits for each to higher and higher precision. To accommodate 
this, we again modify (24) a little and consider 

5-^5^^ (25) 
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Here, S is the set of all infinite sequences. If a sequence s is computable, there is a Turing 
machine that, given a natural number n, prints the first n symbols of s and halts. Let U 
be the partial map that maps an encoding of such a Turing machine to the sequence. We 
define the length of the shortest string s' such that U(s') = f(x) as K^q(x). If there is no 
such string s', we define Kjq(x) = oo. Note that this particular scheme is already quite 
different from ours. According to this scheme, the information in a single point varies in 
a way irrelevant to anything we may be interested in while thinking about subsets of a 
Euclidean plane. If the sequence corresponding to the point is uncomputable, it has infinite 
information. In contrast, for any point p in X, I({p} |^#e) = 1, as we saw in section 5. 
We begin to see how we introduce unnecessary complications by trying to first encode 
everything by a string. 

How about other, uncountable sets? Most of geometric objects are uncountable subsets 
of X. We can imagine encoding lines with a pair of points and circles with a point and a 
real number, etc., encoded as above. That is, we define f so that a line on X going through 
a pair of points is mapped to a sequence encoding two points and some flag indicating that 
it is a line. This is a definition of / by using characteristics, or regularity, of the subset. 
However, the problem is that any interpretation of the sequence in terms of regularity must 
be incorporated in the definition of /. Thus, as we wish to add more patterns — lines, 
circles, repeated patterns — the definition of / has to become more and more elaborate; 
and we would wish to somehow define / computationally. However, this cannot be done 
because the spirit of using Kolmogorov complexity is that any information, including any 
regularities and redundancies, is conveyed through a sequence and any computation must 
be done by a Turing machine, strictly on the left side of S in (25). 

In order to define / computationally, we would have to define the notion of computation 
involving And, of course, that is exactly what our scheme does, where the situation is 
like 

S 3> & (26) 

A given set of maps determines the possible set of diagrams generated by the maps. 
The map h is the interpretation of diagrams into the ground representation. The difference 
between the two schemes is on which side of the center the computation, or the "decom- 
pression," takes place. It occurs in U in (25) and in h in (26). 

Thus, it is not easy to give a reasonable and simple definition of / that makes Kf o(x) 
equivalent to /(x|~#e)- We cannot seem to define a simple encoding / that reflects the 
informal notion of information in the domain of subsets of Euclidean plane X. Even with 
this single, relatively simple class of objects, we have these problems. Remember that 
we cannot just say that there is an encoding and only talk about strings, as we pointed out 
repeatedly; if we insist encoding objects into strings, we need to define a concrete encoding 
for each class of objects. 

The new representation provides a meta-definition for that purpose. It allows specifying 
how to embed multiple computations in larger spaces in a versatile way. As shown in the 
previous section, a Turing machine can be simulated by a diagram generated by {0, succ}. 
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This can be isomorphically embedded in a diagram generated by by using p and move v 
instead of and succ, where p e X and move v : X — » X is a move by a vector v, which can 
be represented by a diagram. Although a computation can be in other more direct forms, 
as in the examples in 4.1, this shows that universal computation can be embedded as a part 
of the representation. 

Also, it allows separating the information that we wish to ignore from that which we 
measure. As we have seen, this is important as some information we want to ignore is 
infinite. The structure maps delineate the a priori parts of the objects whose information 
are to be ignored, leaving only the information in the structure: hence the name "structural 
information." 

Finally, the representation can provide a useful abstraction in designing real-world ap- 
plications. In dealing with high-dimensional signal-level objects, such as sounds, images, 
and sensory readings in robots, we need to model the process of finding patterns in raw 
signal and grouping them together to be described at a symbol level. The information mea- 
sure may be used to regularize the process of finding useful patterns in the ground level. 
We can imagine implementing a system using this representation, which corresponds to 
defining an encoding g of diagrams shown in (26). 

Given a fixed set of structure maps, it can represent infinite sets by finite symbolic 
expressions and sparse parameters. That is, after fixing the sets and maps as primitives, 
diagrams can be symbolically represented just as graphs are. While the fixed sets such as 
Euclidean spaces and the space of real numbers have to be implemented by some system- 
dependant approximation such as floating point numbers, the structure itself is preserved 
irrespective of such approximations. In the case of the line example, although points and 
vectors might be translated between systems varied in, say, resolution, its structure, its 
"fineness," so to speak, would survive. In the current practice, an encoding of a line by 
such sparse data would have to be written into the code; leading to the lack of generality 
and flexibility. 

Also, diagrams can be naturally understood as modules to construct larger and more 
complex ones. The set union and intersection are the most basic ways to combine them; or 
we can put the output of one diagram into the input of another. Ultimately, any computation 
can be used to combine them. Our formulation provides a useful abstraction that allows us 
to model such structures in a uniform way so that they can be manipulated automatically. 

Conclusion 

In this paper, we introduced a uniform representation of general objects. In an abstraction 
of the dense representation, which includes strings, bitmaps, and other raw data, the ob- 
jects are assumed to be given a priori as subsets of some sets. The proposed representation 
uses a new construct called diagram to represent objects with regularity through sparse 
parameters, using the maps that characterize the space in which the objects are included 
as subsets. Since the representation can emulate any computation, it can exploit any com- 
putable regularities in objects to compactly describe them. It is also general enough to 
represent random objects as raw data, making it possible to interpolate between the dense 
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and sparse representation. There is a prescribed way to connect the description to the raw 
data; thus, the representation is grounded. In other words, the relationship between the pa- 
rameters and the data is part of the representation so that we can give raw data and then ask 
what sparser, more structured representation is possible. With the representation, we also 
defined a measure of information in the objects. We proved that the measure is equivalent 
to Kolmogorov complexity in the case of strings. 

To answer our question in the introduction, we would say that a subset A of a set X with 
structures characterized by a set j$ of maps is a pattern if I(A \^) is significantly smaller 
than |A| in the case A is finite and, in the case A is infinite, if I(A \^) is finite. Even when 
I(A \^) is infinite, there can be a pattern in a statistical sense, which we leave for future 
work. 
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